alexr_rwx: (Default)
Alex R ([personal profile] alexr_rwx) wrote2010-06-22 01:56 am

May I present: kompressr

kompressr: make text shorter harnessing the power of acronyms (MTSHTPOA)

Try it out, let me know what you think and if it breaks :) Should be useful for making long papers shorter, by automatically extracting acronyms and using them wherever possible.

Public beta!tm

(running on App Engine, with NLTK!)

[personal profile] chrisamaphone 2010-06-22 07:04 am (UTC)(link)
cool! i tried it on my recent LJ posts to get an idea for what it did, and then learned a bunch about my own writing patterns. :) some of the acronyms were pretty awkward; at first i thought because they include common short words but maybe more because they don't fall on natural phrase boundaries. stuff like "dinner at", "talking about", "and the", "i had a"...
Edited 2010-06-22 07:05 (UTC)
lindseykuper: Photo of me outside. (Default)

[personal profile] lindseykuper 2010-06-22 05:22 pm (UTC)(link)
and then learned a bunch about my own writing patterns.

Unintended benefit! That's awesome.
ext_110843: (deus ex machina)

[identity profile] oniugnip.livejournal.com 2010-06-23 01:31 am (UTC)(link)
Thanks thanks!

Not breaking along phrase boundaries is both easier to implement, and what I was going for (ie, kind of senseless, but following a pattern that's intuitively clear).

The alternative would require some light parsing -- maybe chunking for noun phrases...