Welcome to WebmasterWorld Guest from

Forum Moderators: open

Message Too Old, No Replies

Automatic Captions Rolling Out On YouTube

11:27 am on Nov 20, 2009 (gmt 0)

Administrator from GB 

WebmasterWorld Administrator engine is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:May 9, 2000
votes: 999

Automatic Captions Rolling Out On YouTube [news.bbc.co.uk]
YouTube's parent company Google has announced on its blog that automatic captions are to begin to roll out across the site. The machine-generated captions will initially be generated in English. At first they will only be found on 13 channels. These include National Geographic, Columbia, as well as most Google and YouTube channels.

The software engineer behind the technology, Ken Harrenstien, is deaf. Currently YouTube offers a manual captioning service but video makers tend not to use it.

"The majority of user-generated video content online is still inaccessible to people like me," Mr Harrenstien wrote in the Google blog.

11:55 am on Nov 20, 2009 (gmt 0)

Moderator from GB 

WebmasterWorld Administrator mack is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:June 15, 2001
votes: 93

I think in cases like the post mentioned there are very valid reasons for captions. But in most cases the existing caption tools are being abused.


5:29 pm on Nov 20, 2009 (gmt 0)

Senior Member

WebmasterWorld Senior Member ergophobe is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Apr 25, 2002
votes: 283

If they are using the same technology that Google Voice uses to transcribe my phone messages, this will be all but useless.

At best, I can often grasp general context from a phone message. It tends to translate names into other words. So "This is Tommy" was transcribed as "This is call me". It does fairly well on stop words and numbers, but tends overwhelmingly to miss the words of significance.

It's fun to guess though. I actually guessed based on area code and the few correct words of significance that "call me" was "Tommy".

Still, there's a long way to go for this be a significant help to the deaf. Professional audio from a professional announcer who doesn't say "um" a lot (a killer on Google Voice transcriptions) might help though.

Assuming they roll it out more broadly, it raises an interesting question. Four actually

1. Is Google using captions in the index currently?

2. Is this also the first step to enabling search of audio and video content beyond title, tags and surrounding text?

3. Will they be able to implement duplicate content filters eventually? I know there are people who use the same video many times, with different tags, titles and surrounding text. My understanding is that currently, if you change the lenght by a second or two, it will not be seen as the same video.

4. Does this make it more important to add voiceover to your videos instead of just my dramatic video of birds diving underwater?