Parser

A minimal test of Haskell performance for the 1 billion rows challenge. This code doesn't try to achieve very high speed, but rather to investigate if a Haskell program can achieve decent execution performance straight out of the box.

Chris Penner's Beating C With 80 Lines Of Haskell: Wc was used as the inspiration for fast code, ie:

removing lazy evaluation when it's not necessary,
inlining functions, and
unboxing structures.

And you get icing on the cake by breaking down processing over long arrays on a per-CPU core basis.

The wc utility serves as the benchmark to see how the Haskell program is doing. Happy to report that when running on both MacOS M1 cpu and Linux AMD & Intel CPUs, the Haskell program is ~4-5x slower than wc. Importantly it runs in constant memory space. Given it's written without any special optimization tricks but for the 4 considerations mentioned before, using standard dictionary (Data.Map), text streaming (BasicString.Lazy), structure concatenation (Semigroup instancing) and multi-threading (forConcurrently).

The logic for the 1BRC is the 200 lines of src/Parsing/SimpleA.hs, the rest of the code in the repo is simply a generic application wrapper to manage configurations, CLI arguments, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
app		app
src		src
test		test
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
Parser.cabal		Parser.cabal
README.md		README.md
Setup.hs		Setup.hs
makeTestData.hs		makeTestData.hs
package.yaml		package.yaml
stack.yaml		stack.yaml
weather_stations.csv		weather_stations.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parser

About

Releases

Packages

Languages

License

hugodro/fudd-1brc

Folders and files

Latest commit

History

Repository files navigation

Parser

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages