Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for WebVTT and MicroDVD #462

Open
Diaoul opened this issue Jul 13, 2015 · 3 comments
Open

Add support for WebVTT and MicroDVD #462

Diaoul opened this issue Jul 13, 2015 · 3 comments

Comments

@Diaoul
Copy link
Owner

Diaoul commented Jul 13, 2015

Will require to switch to pycaption for validation
Not compatible with python 3, abandoned project?

Want to back this issue? Post a bounty on it! We accept bounties via Bountysource.

@Diaoul Diaoul changed the title Add support for TTML Add support for WebVTT and MicroDVD Nov 6, 2015
@Diaoul
Copy link
Owner Author

Diaoul commented Nov 6, 2015

@Toilal @wackou: I want to create my own robust subtitle parser and will likely create a new library for that that handles various formats.
I'm looking for the right tool for the job, all subtitles formats seem to have a defined grammar that makes parsing easily possible. There are various technologies for that (PEG parsers, lexers such as LEX or YACC) and so on. Would you recommend one for that kind of work?

I saw various tools such as pyparsing, PLY, pyPEG and parsimonious.
I wonder if rebulk would be able to do that? There's no decision making so I think it's not the right tool. There is also the possibility to have my own basic parser based on str and re.

Ideas are welcome 🍥

@Toilal
Copy link
Contributor

Toilal commented Nov 6, 2015

Do you have examples and/or specs for those formats ?

Rebulk can be used for "short input" and "pseudo-natural" language. I don't think it's the write tool to parse a structured file. It's designed to define patterns (string, regex or functional) than will be scanned in the whole input string, retrieve consistent match objects from those different type of patterns, and filter out false positives with rules implying relations between those matches.

I've never used mentioned parsers in python sorry :)

@Diaoul
Copy link
Owner Author

Diaoul commented Nov 6, 2015

@Diaoul Diaoul added this to the 1.2 milestone Dec 30, 2015
@Diaoul Diaoul modified the milestones: 1.2, Later Mar 8, 2016
@getzze getzze mentioned this issue Jun 18, 2024
18 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants