Site Auditor

About

This tool is currently a WIP. The aim is to create a front end webapp that a developer can use to analyse the web pages they are creating for issues and performance enhancements.

Currently the app runs from the command line as python main.py (see installation notes) and will crawl the url provided to the depth provided and generate a set of reports (csv) in the reports directory.

bugs/issues

Why

To audit websites, some of which aren't accessible publicly. The audit is for validation, performance and SEO.

There are a few tools out there that do bits of what I am after, but either they cost money, are very slow and also don't do everything that I needed in one place - so I created another tool!

Installation

brew install tidy-html5

brew install phantomjs

brew install yarn

setup a virtualenvironment (preferable)

pip install -r requirements.txt

yarn

npm install gulp

gulp

Configuration

cp settings/settings_example.py settings/settings.py

Then edit to provide valid database name, user, and password settings.

Run

python main.py -u http://localhost:8000

Options

Resume session with depth of 3 links and do performance reviews using YSlow.

python main.py -u http://localhost:8000 -d 3 -s 33a257d4-0664-11e7-9aa7-24a074f076f8 -p

-h, --help show the help message and exit

-u URL, --url URL The URL to start the crawl with. 0 depth (see -d) will crawl only the input URL

-d DEPTH, --depth DEPTH Depth of the search when following internal links

-s SESSION, --session SESSION Resume a previous session by adding the session key

-p, --performance Run performance tools (YSlow). Because the test is slow and resource intensive, this is best done after all other metrics are passing for an audit

-nr, --no-report Prevent the generate of CSVs in the report directory. Ideal if you are using the web app

-nc, --no-crawl Prevent a crawl. Ideal for generating reports based on existing crawls

View Results

CSV

If the CSVs flag is on then reports will be generated into the reports/ directory

Web app

Run python webapp.py and visit http://127.0.0.1:5000

Basic Roadmap

See the issues log for enhancements

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
models		models
settings		settings
src/js		src/js
templates		templates
tools		tools
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
gulpfile.js		gulpfile.js
main.py		main.py
package.json		package.json
requirements.txt		requirements.txt
tests.py		tests.py
tidy-options.json		tidy-options.json
watcher.py		watcher.py
webapp.py		webapp.py
webpack.config.js		webpack.config.js
webpack.config.production.js		webpack.config.production.js
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Site Auditor

About

Why

Installation

Configuration

Run

Options

View Results

CSV

Web app

Basic Roadmap

About

Releases

Packages

Languages

carl-topham-tools/personal-website_auditor

Folders and files

Latest commit

History

Repository files navigation

Site Auditor

About

Why

Installation

Configuration

Run

Options

View Results

CSV

Web app

Basic Roadmap

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages