era's Bookmarks Tagged With "language.technology"
-
Joseph Rudman: Non-Traditional Authorship Attribution Studies in Eighteenth Century Literature
http://computerphilologie.uni-muenchen.de/jg02/rudman.html
-
Not rated yet.
- Details
"Non-traditional authorship attribution studies are those attribution studies that make use of the computer, statistics, and stylistics. The hypothesis behind these studies is that an author has a unique and identifiable style. The computer has no… More
-
-
Rudman 1999: The Hypothetical and Theoretical Underpinnings of Non-traditional Authorship Attribution Studies: Assumptions, Presumptions, and Verifiable Constructs
http://www.iath.virginia.edu/ach-allc.99/proceedings/rudman.html
-
Not rated yet.
- Details
HTML version of a 1999 (?) paper questioning the underpinnings of authorsip identification
-
-
Snowball
http://snowball.tartarus.org/
-
Not rated yet.
- Details
Snowball is a small string processing language designed for creating stemming algorithms for use in Information Retrieval. This site describes Snowball, and presents several useful stemmers which have been implemented using it.
-
-
non-responsive removed/redacted documents from previous FOIA Case 47415 (PDF)
http://www.governmentattic.org/docs/NSA_Nonresponsive_Docs_FOIA47415.pdf
-
Not rated yet.
- Details
Somebody approached the NSA to get them to reveal the missing parts of the "Acquaintance" n-gram classification patent, under the Freedom Of Information Act. PDF of scanned email correspondence between Marc Damashek and various interested parties.
-
-
Introduction to Tibetan Orthography || kuro5hin.org
http://www.kuro5hin.org/story/2004/2/5/01839/12103
-
Not rated yet.
- Details
"Here's a brief introduction to one of the world's most dysfunctional scripts." Commenters disagree and say it all makes sense ... sort of.
-
-
Fonts - OLPC
http://wiki.laptop.org/go/Fonts
-
Not rated yet.
- Details
Broad discussion of international fonts for the OLPC project.
-
-
ASP unicode to punycode idn online decoder and encoder.
http://www.motobit.com/util/punycode-decoder-encoder.asp
-
Not rated yet.
- Details
On-line encoding / decoding for your quick IDN (International Domain Names) and Punycode needs.
-
-
Reuters Corpora @ NIST
http://trec.nist.gov/data/reuters/reuters.html
-
Not rated yet.
- Details
In order to download Reuters corpora, need to sign a license agreement. RCV2 is multilingual in 13 languages.
-
-
Fun With Markov Chains
http://www.eblong.com/zarf/markov/
-
Not rated yet.
- Details
Yet another Markov chain / dissociated press / parody generator page. 'I am often asked about my message signature, which has been appearing at the bottom of email and Usenet postings for years now: "And Aholibamah bare Jeush, and Jaalam, and Kora… More
-
-
Spellify - An Automatic Text Field Spell Checker
http://www.spellify.com/
-
Not rated yet.
- Details
"A web based ajax spell checker that automatically checks the contents of a text field upon completion of typing." Their Finnish speller seems to be sorely lacking (no surprise there -- they should do their homework before offering a language with… More
-
Didn't find what you were looking for? Try searching Google.
Publish or subscribe to era's Bookmarks Tagged With "language.technology" via RSS and more...


