Giter VIP home page Giter VIP logo

Comments (3)

anshul1912 avatar anshul1912 commented on May 14, 2024

It is possible only if the video have word by word timing inside, do you have some video where each words of some lines are shown at different time.

like below caption
"This is caption"
if first "This" on x second is shown
then "is" on x+1 second shown
then "caption" on x+2 second is shown

I have never seen a video like that, though people do make cc in such a way where they
show "This" then "This is" then "This is caption", here data is redundant but people's are using it for effect.

In closed caption timing is generally taken from PES packet which contain closed caption, so if your each word is in different PES packet then you can get that timing, I don't think there are any sane closed caption encoder who display one word at a time with each frame, that decrease readability of those statement and it would not be useful too.

can you elaborate why are you interested in identifying the precise in and out timestamps of specific words

from ccextractor.

cfsmp3 avatar cfsmp3 commented on May 14, 2024

For captions transmitted in roll-up we could have word-by-word timing
(since characters are displayed as received); however in roll-up, which is
used mostly for newscasts and other content transcribed in real time,
there's no lipsync (captions are at least a couple seconds behind audio) so
there's no value in doing that either.

On Sun, Nov 9, 2014 at 8:35 AM, Brannon Dorsey [email protected]
wrote:

Hi there,

I am interested in identifying the precise in and out timestamps of
specific words embedded in the closed caption data of an mpeg2 stream. It
seems that with CCExtractor, only lines of text are indexed in this way. Is
this a limitation of CCExtractor specifically, or the standards of CC in
digital broadcast? If this functionality is not directly built into
CCExtractor, would you have any suggestions as to how to extract and use
this very specific data?


Reply to this email directly or view it on GitHub
#120.

from ccextractor.

brannondorsey avatar brannondorsey commented on May 14, 2024

Hi all,
Thank you both for your timely responses! It seems as if no closed captioning will provide me with the level of control that I am looking for in tagging words in television programs. I am looking to create a database of precise in and out points of words in network TV and Movies. That said, CCExtractor is a really fine piece of software.

from ccextractor.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.