Drink Me: (Ab)Using a LLM to Compress Text
Introduction
Large language models are trained on huge datasets of text to learn the relationships and contexts of words within larger documents. These relationships are what allows the model to generate text.
Recently I've read concerns about LLMs being trained on copyrighted text and reproducing it. This got me thinking:
Read more here: External Link