Various voice transcription services can be used to process multimedia data (the audio track) and produce a text transcript from it. The degree of success varies, and in most cases requires manual postprocessing by human. Services like VoiceBase are good choice for this type of conversion as it supports multiple formats as inputs, including Windows Media Video (.wmv).
Open WMV file Open TXT file