GitHub - billfitzgerald/trapper-keeper: Archive, organize, and watch for changes to publicly available information.

1. Overview

The Trapper Keeper is a collection of scripts that support archiving information from around the web to make it easier to study and use. If you are a researcher working with online material, an educator creating openly licensed content, or a curious person who likes to learn more about different subjects, then Trapper Keeper might be helpful to you. Trapper Keeper can currently archive and clean web pages and pdfs.

Trapper Keeper supports these features:

Archive data from multiple sources;
Clean data and save it as text;
List out embedded media and links;
Retain a copy of embedded images in the source text;
Track the source material for changes;
Organize your cleaned, archived data into arbitrary collections - a "collection" can be anything that unifies a set of information; ie, a set of urls that all relate to a specific topic; or a set of information that will be remixed into chapters;
Export a list of all tracked URLs.

2. Installation

Trapper Keeper has been tested on OSX and Ubuntu Linux. No testing has taken place on Windows machines.

3. General Use

The Trapper Keeper Overview page contains instructions on using Trapper Keeper.

4. Additional Details

The scripts in Trapper Keeper use csv files to help organize information. Sample csv files are included in the /samples directory.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
holepunch		holepunch
samples		samples
utilities		utilities
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
archive.py		archive.py
collect_texts.py		collect_texts.py
discovery.py		discovery.py
export.py		export.py
housekeeping.py		housekeeping.py
requirements.txt		requirements.txt
show_diffs.py		show_diffs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

1. Overview

2. Installation

3. General Use

4. Additional Details

About

Releases

Packages

Contributors 3

Languages

License

billfitzgerald/trapper-keeper

Folders and files

Latest commit

History

Repository files navigation

1. Overview

2. Installation

3. General Use

4. Additional Details

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages