The text I chose to compress:
A_tutor_who_tooted_the_flute_Tried_to_tutor_two_tooters_to_toot_Said_the_two_to_their_tutor,_“Is_it_harder_to_toot_Or_to_tutor_two_tooters_to_toot?”
My compression:
A_☄_who_☀ed☇flute_Tried☆★Said☇☃_☂_their_☄,_“Is_it_harder★Or☆_☂_☀?”
The dictionary:
toot
to
two
tu:open_umbrella:r
★ _:sunny:
☆ _☂_☄_☃_☀ers
☇ the
Compression Numbers:
Compressed text size: 66 bytes
Dictionary size: 41 bytes
Total: 107 bytes
Original text size: 148 bytes
Compression: 27.7%
Process:
- Looked for parts of word that repeated (i.e. “th” in “the”, “that”, “those”, etc)
- Looked for phrases that repeated (used symbols already in dictionary, if possible)
- Looked for words that repeated (using symbols in dictionary, if possible)
Challenges students may encounter include:
Deciding when it is worth compressing as some compressions will reduce the compression rate
Finding the patterns and realizing that they can use the symbols as part of the word/phrase they are compressing
I will encourage students to use trial and error and to continue to look at the compression numbers to see what is of value to compress. Also will remind the students to consider the dictionary size since it is part of the file size!