Transcription software: Difference between revisions

Content deleted Content added

Inline

Revision as of 19:08, 2 October 2023

Transcription software assists in the conversion of human speech into a text transcript. Audio or video files can be transcribed manually or automatically.^[1] Transcriptionists can replay a recording several times in a transcription editor and type what they hear. By using transcription hot keys, the manual transcription can be accelerated, the sound filtered, equalized or have the tempo adjusted when the clarity is not great. With speech recognition technology, transcriptionists can automatically convert recordings to text transcripts by opening recordings in a PC and uploading them to a cloud for automatic transcription, or transcribe recordings in real-time by using digital dictation. Depending on quality of recordings, machine generated transcripts may still need to be manually verified. The accuracy rate of the automatic transcription depends on several factors such as background noises, speakers' distance to the microphone, and accents.

Transcription software, as with transcription services, is often provided for business, legal, or medical purposes. Compared with audio content, a text transcript is searchable, takes up less computer memory, and can be used as an alternate method of communication, such as for closed captions.

The definition of transcription "software", as compared with transcription "service", is that the former is sufficiently automated that a user can run the entire system without engaging outside personnel. However, the advent of software-as-a-service and cloud computing models blur this distinction. It uses artificial intelligence, machine learning and natural language processing to convert speech to text and continuously learn new phrases and accents.^[2]

Development

Research at Google released a free android app Google Live Transcribe, it runs on Google Cloud.^[3]^[4] Google Chrome developed and has a available built in English Live Caption.^[5] Google Docs, Google Translate, Google Assistant, GBoard Google Text to Speech engine support transcription tool too.^[6]^[7]^[8]^[9]

OpenAI launched Whisper, an open-source speech recognition deep learning model in September 2022.^[10].

References

^ "Transcription Functions | Transcribear". General Transcription Functions and Conventions, Audio Transcriptions. 2017-06-08. Retrieved 2019-02-15.{{cite news}}: CS1 maint: url-status (link)
^ Bhatt, Medha. "What is AI Transcription? Everything You Need to Know". fireflies.ai. Retrieved 3 June 2022.
^ "Use Live Transcribe - Android Accessibility Help". support.google.com. Retrieved 2021-06-14.
^ Butler, Sydney (2019-12-09). "How to transcribe speech using Google's Live Transcribe app". 9to5Google. Retrieved 2021-06-14.
^ "Google Chrome's new Live Caption feature will transcribe speech in videos". techxplore.com. Retrieved 2021-06-14.
^ "Now you can transcribe speech with Google Translate". Google. 2020-03-17. Retrieved 2021-06-14.
^ Krasnoff, Barbara (2020-08-14). "How to use Google's free transcription tools". The Verge. Retrieved 2021-06-14.
^ "Live Transcribe & Sound Notifications - Apps on Google Play". play.google.com. Retrieved 2021-06-14.
^ "Google Rolling Out Real-Time Transcription and Translation for Gboard Users". Retrieved 2021-06-14.
^ Golla, Ramsri Goutham (2023-03-06). "Here Are Six Practical Use Cases for the New Whisper API". Slator. Archived from the original on 2023-03-25. Retrieved 2023-08-12.

[1] "Transcription Functions | Transcribear". General Transcription Functions and Conventions, Audio Transcriptions. 2017-06-08. Retrieved 2019-02-15.{{cite news}}: CS1 maint: url-status (link)

[2] Bhatt, Medha. "What is AI Transcription? Everything You Need to Know". fireflies.ai. Retrieved 3 June 2022.

[3] "Use Live Transcribe - Android Accessibility Help". support.google.com. Retrieved 2021-06-14.

[4] Butler, Sydney (2019-12-09). "How to transcribe speech using Google's Live Transcribe app". 9to5Google. Retrieved 2021-06-14.

[5] "Google Chrome's new Live Caption feature will transcribe speech in videos". techxplore.com. Retrieved 2021-06-14.

[6] "Now you can transcribe speech with Google Translate". Google. 2020-03-17. Retrieved 2021-06-14.

[7] Krasnoff, Barbara (2020-08-14). "How to use Google's free transcription tools". The Verge. Retrieved 2021-06-14.

[8] "Live Transcribe & Sound Notifications - Apps on Google Play". play.google.com. Retrieved 2021-06-14.

[9] "Google Rolling Out Real-Time Transcription and Translation for Gboard Users". Retrieved 2021-06-14.

[10] Golla, Ramsri Goutham (2023-03-06). "Here Are Six Practical Use Cases for the New Whisper API". Slator. Archived from the original on 2023-03-25. Retrieved 2023-08-12.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

@@ Line 1: / Line 1: @@
 {{Short description|Software that assists in the conversion of human speech into a text transcript}}
 {{more citations needed|date=January 2017}}
-'''Transcription software''' assists in the conversion of human speech into a text transcript. Audio or video files can be transcribed manually or automatically.<ref>{{Cite news|url=https://transcribear.com/transcription.asp|title=Transcription Functions {{!}} Transcribear|last=|first=|date=2017-06-08|work=General Transcription Functions and Conventions, Audio Transcriptions|access-date=2019-02-15}}</ref> Transcriptionists can replay a recording several times in a transcription editor and type what they hear. By using transcription hot keys, the manual transcription can be accelerated, the sound filtered, equalized or have the tempo adjusted when the clarity is not great. With [[speech recognition]] technology, transcriptionists can automatically convert recordings to text transcripts by opening recordings in a PC and uploading them to a cloud for automatic transcription, or transcribe recordings in real-time by using [[digital dictation]]. Depending on quality of recordings, machine generated transcripts may still need to be manually verified. The accuracy rate of the automatic transcription depends on several factors such as background noises, speakers' distance to the microphone, and accents.
+'''Transcription software''' assists in the conversion of human speech into a text transcript. Audio or video files can be transcribed manually or automatically.<ref>{{Cite news|url=https://transcribear.com/transcription.asp|title=Transcription Functions {{!}} Transcribear|last=|first=|date=2017-06-08|work=General Transcription Functions and Conventions, Audio Transcriptions|access-date=2019-02-15|url-status=live}}</ref> Transcriptionists can replay a recording several times in a transcription editor and type what they hear. By using transcription hot keys, the manual transcription can be accelerated, the sound filtered, equalized or have the tempo adjusted when the clarity is not great. With [[speech recognition]] technology, transcriptionists can automatically convert recordings to text transcripts by opening recordings in a PC and uploading them to a cloud for automatic transcription, or transcribe recordings in real-time by using [[digital dictation]]. Depending on quality of recordings, machine generated transcripts may still need to be manually verified. The accuracy rate of the automatic transcription depends on several factors such as background noises, speakers' distance to the microphone, and accents.
 Transcription software, as with [[transcription (service)|transcription services]], is often provided for business, legal, or [[medical transcription|medical purposes]]. Compared with audio content, a text transcript is searchable, takes up less computer memory, and can be used as an alternate method of communication, such as for [[Closed captioning|closed captions]].
@@ Line 8: / Line 8: @@
 == Development ==
-Research at Google released a free android app [[Google Live Transcribe]], it runs on [[Google Cloud]].<ref>{{Cite web|title=Use Live Transcribe - Android Accessibility Help|url=https://support.google.com/accessibility/android/answer/9158064?hl=en|access-date=2021-06-14|website=support.google.com}}</ref><ref>{{Cite web|last=Butler|first=Sydney|date=2019-12-09|title=How to transcribe speech using Google's Live Transcribe app|url=https://9to5google.com/2019/12/08/how-to-transcribe-speech-using-googles-live-transcribe-app/|access-date=2021-06-14|website=9to5Google|language=en-US}}</ref> [[Google Chrome]] developed  and has a available built in English Live Caption.<ref>{{Cite web|title=Google Chrome's new Live Caption feature will transcribe speech in videos|url=https://techxplore.com/news/2021-03-google-chrome-feature-speech-videos.html|access-date=2021-06-14|website=techxplore.com|language=en}}</ref> [[Google Docs]], [[Google Translate]], [[Google Assistant]], [[Gboard|GBoard]] [[Google Text-to-Speech|Google Text to Speech engine]] support transcription tool too.<ref>{{Cite web|date=2020-03-17|title=Now you can transcribe speech with Google Translate|url=https://blog.google/products/translate/transcribe-speech/|access-date=2021-06-14|website=Google|language=en-us}}</ref><ref>{{Cite web|last=Krasnoff|first=Barbara|date=2020-08-14|title=How to use Google's free transcription tools|url=https://www.theverge.com/21368867/transcription-google-docs-live-transcribe-how-to-zoom|access-date=2021-06-14|website=The Verge|language=en}}</ref><ref>{{Cite web|title=Live Transcribe & Sound Notifications - Apps on Google Play|url=https://play.google.com/store/apps/details?id=com.google.audio.hearing.visualization.accessibility.scribe&hl=en_US&gl=US|access-date=2021-06-14|website=play.google.com|language=en}}</ref><ref>{{Cite web|title=Google Rolling Out Real-Time Transcription and Translation for Gboard Users|url=https://www.digitalinformationworld.com/2020/08/google-rolling-out-real-time-transcription-and-translation-for-gboard-users.html|access-date=2021-06-14}}</ref>
+Research at Google released a free android app [[Google Live Transcribe]], it runs on [[Google Cloud]].<ref>{{Cite web|title=Use Live Transcribe - Android Accessibility Help|url=https://support.google.com/accessibility/android/answer/9158064?hl=en|access-date=2021-06-14|website=support.google.com}}</ref><ref>{{Cite web|last=Butler|first=Sydney|date=2019-12-09|title=How to transcribe speech using Google's Live Transcribe app|url=https://9to5google.com/2019/12/08/how-to-transcribe-speech-using-googles-live-transcribe-app/|access-date=2021-06-14|website=9to5Google|language=en-US}}</ref> [[Google Chrome]] developed  and has a available built in English Live Caption.<ref>{{Cite web|title=Google Chrome's new Live Caption feature will transcribe speech in videos|url=https://techxplore.com/news/2021-03-google-chrome-feature-speech-videos.html|access-date=2021-06-14|website=techxplore.com|language=en}}</ref> [[Google Docs]], [[Google Translate]], [[Google Assistant]], [[Gboard|GBoard]] [[Google Text-to-Speech|Google Text to Speech engine]] support transcription tool too.<ref>{{Cite web|date=2020-03-17|title=Now you can transcribe speech with Google Translate|url=https://blog.google/products/translate/transcribe-speech/|access-date=2021-06-14|website=Google|language=en-us}}</ref><ref>{{Cite web|last=Krasnoff|first=Barbara|date=2020-08-14|title=How to use Google’s free transcription tools|url=https://www.theverge.com/21368867/transcription-google-docs-live-transcribe-how-to-zoom|access-date=2021-06-14|website=The Verge|language=en}}</ref><ref>{{Cite web|title=Live Transcribe & Sound Notifications - Apps on Google Play|url=https://play.google.com/store/apps/details?id=com.google.audio.hearing.visualization.accessibility.scribe&hl=en_US&gl=US|access-date=2021-06-14|website=play.google.com|language=en}}</ref><ref>{{Cite web|title=Google Rolling Out Real-Time Transcription and Translation for Gboard Users|url=https://www.digitalinformationworld.com/2020/08/google-rolling-out-real-time-transcription-and-translation-for-gboard-users.html|access-date=2021-06-14}}</ref>
+OpenAI launched [[Whisper (speech recognition system)|Whisper]], an open-source [[speech recognition]] [[deep learning]] model in September 2022.<ref>{{Cite web |last=Golla |first=Ramsri Goutham |date=2023-03-06 |title=Here Are Six Practical Use Cases for the New Whisper API |url=https://slator.com/six-practical-use-cases-for-new-whisper-api/ |url-status=live |archive-url=https://web.archive.org/web/20230325214704/https://slator.com/six-practical-use-cases-for-new-whisper-api/ |archive-date=2023-03-25 |access-date=2023-08-12 |website=Slator |language=en-US}}</ref>.
-Whisper is an automatic speech recognition (ASR) system developed by [[OpenAI]]. Launched in September 2022,<ref>{{Cite web |title=Introducing Whisper |url=https://openai.com/research/whisper}}</ref> this deep learning model is specifically trained on low-quality data to achieve higher accuracy in speech recognition tasks. The model's design allows it to interpret and transcribe audio data that may be challenging for other ASR systems.
-In an effort to promote collaboration and innovation within the scientific community, OpenAI initially released the code for Whisper as open-source. Subsequently, an API was also made available, providing developers and researchers with a platform to integrate and experiment with the model in various applications.
-A demonstration of Whisper's capabilities can be seen at [https://listenmonster.com/ ListenMonster], showcasing the model's proficiency in handling various speech recognition tasks. The public release of both the code and API underscores OpenAI's commitment to advancing the field of artificial intelligence through transparency and accessibility
 ==See also==