Run length encoding

Run length encoding is an algorithm for performing lossless data compression. Lossless data compression refers to compressing the data in such a way that the original form of the data can then be derived from it. When a character occurs a large number of times consecutively in a sequence, then we can represent the same consecutive subsequence using only a single occurrence of that character and its count. Using run length encoding, we can save memory space while transmitting data and preserving its original form. It is useful when we want to store or transmit large sequences of data.

For example if the sequence is : AACCCBBBBBAAAAFFFFFFFFZ

Then, using run length encoding, we can represent it as: 2A3C5B4A8FZ

The 23 length sequence was compressed to a 11 length sequence.

The functions encode and decode in main.py use regex to parse the input strings in to the appropriate groups of characters and then output the respected encoded and decoded strings.

To run the tests install pytest pip install pytest and run pytest test.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
main.py		main.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Run length encoding

About

Releases

Packages

Languages

tesmond/run_length_encoding

Folders and files

Latest commit

History

Repository files navigation

Run length encoding

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages