Skip to content

Recursively parse dom objects to identify repeated patterns

Notifications You must be signed in to change notification settings

drrobotnik/recursive-dom-parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

recursive-dom-parser

Recursively parse dom objects to identify repeated patterns

What does this do?

The idea here is to traverse a scraped website and parse each html file for each dom element. By parsing through each page you can identify re-used elements. Potentially ripe to be made into components.

python soup.py --path="../../scraped-site/" --element=div > div.html

About

Recursively parse dom objects to identify repeated patterns

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages