ybs Subtitles
Solving Sentence Break Issues
Users who have used YouTube's automatically generated subtitles know that there has always been an issue with sentence breaks. These subtitles lack punctuation, different sentences stick together, and break off at some point (as shown in the main subtitle below). Since YouTube's subtitle translation is based on these broken sentences, it leads to poor translation quality.
We use machine learning to fix these subtitles, recombining the broken sentences, greatly improving the translation quality.
Generation Speed
Since ybs subtitles are based on existing subtitles, this makes their generation extremely fast.
For example, for the 19-hour tutorial video of freecodecamp, we only need 30 seconds to generate all subtitles.
For the 48-minute video of Melanie Nakagawa, Chief Sustainability Officer at Microsoft, ybs takes 12 seconds, while whisper takes 3 minutes and 22 seconds (using RTX 4090)
Supported Languages
Here are the languages supported by ybs subtitles:
- Arabic https://www.youtube.com/@AlArabiya
- Bengali https://www.youtube.com/@republicbangla
- Bulgarian https://www.youtube.com/@bTVMediaGroup
- Czech https://www.youtube.com/@tvnovaofficial
- Danish https://www.youtube.com/@p3essensen
- Dutch https://www.youtube.com/@nosop3
- English https://www.youtube.com/@ABCNews
- French https://www.youtube.com/@FRANCE24
- Persian https://www.youtube.com/@afintltv
- Filipino/Tagalog https://www.youtube.com/@GMARegionalTV
- Finnish https://www.youtube.com/@psverkkomedia
- German https://www.youtube.com/@WELTVideoTV
- Greek https://www.youtube.com/@SKAIgr
- Gujarati https://www.youtube.com/@News18Gujarati
- Hebrew https://www.youtube.com/@now14
- Hindi https://www.youtube.com/@knh9443
- Hungarian https://www.youtube.com/watch?v=fOpIfaBprDU
- Indonesian https://www.youtube.com/@tvOneNews
- Italian https://www.youtube.com/@euronewsit
- Japanese https://www.youtube.com/@ntv_news
- Kannada https://www.youtube.com/@tv9kannada
- Korean https://www.youtube.com/@MBCNEWS11
- Latvian https://www.youtube.com/c/LTVZiņudienests
- Lithuanian https://www.youtube.com/@LRTinklas
- Malayalam https://www.youtube.com/@24OnLive
- Marathi https://www.youtube.com/@24OnLive
- Norwegian https://www.youtube.com/@tvnorge
- Polish https://www.youtube.com/@Telewizja_Republika
- Portuguese https://www.youtube.com/@euronewspt
- Punjabi https://www.youtube.com/@TheKhalasTv
- Romanian https://www.youtube.com/@StirileProTV
- Russian https://www.youtube.com/@ictv
- Slovak https://www.youtube.com/@Aktuality_sk
- Spanish https://www.youtube.com/@Aktuality_sk
- Swedish https://www.youtube.com/@SVTHumor_
- Tamil https://www.youtube.com/@thanthitv
- Telugu https://www.youtube.com/@ETVAndhraPradesh
- Thai https://www.youtube.com/@ThaiPBS
- Turkish https://www.youtube.com/@TürkçeHaber
- Ukrainian https://www.youtube.com/@tsn
- Vietnamese https://www.youtube.com/@antvtruyenhinhcongannhandan
AI Repair
Since ybs subtitles are generated based on automatically generated subtitles, the quality of voice recognition technology used by YouTube directly affects the quality of ybs subtitles. Some languages have lower error rates, like English, while others frequently have errors, like Japanese. Currently, our solution is to use AI to repair these subtitles, which can fix some more obvious errors such as grammatical mistakes or missing vocabulary, but it may also change the sentence structure and wording.