Skip to content

Using text recognition and unsupervised learning to cluster web bookmarks

Notifications You must be signed in to change notification settings

chris-santiago/bookmarks_clustering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Files

Filename Contents Container
bookmarks.p Pickled list of bookmark titles and URLs Bookmark
bookmarks_data.json List of bookmark titles, URLs and content Dictionary
bookmarks_data.p Pickled list of bookmark titles, URLs and content Dictionary
bookmarks_df.p Pickled dataframe of bookmark titles, URLs and content DataFrame
websites.p Pickled list of bookmark titles, URLs and content Website

Collections

Bookmark = namedtuple('Bookmark', ['title', 'url'])
Website = namedtuple('Website', ['title', 'url', 'content'])

About

Using text recognition and unsupervised learning to cluster web bookmarks

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published