era's Bookmarks Tagged With "ir"
-
Snowball
http://snowball.tartarus.org/
-
Not rated yet.
- Details
Snowball is a small string processing language designed for creating stemming algorithms for use in Information Retrieval. This site describes Snowball, and presents several useful stemmers which have been implemented using it.
-
-
RCV1: A New Benchmark Collection for Text Categorization Research (Lewis, D. D.; Yang, Y.; Rose, T.; and Li, F.; Journal of Machine Learning Research, 2004)
http://www.ai.mit.edu/projects/jmlr/papers/volume5/lewis04a/lyrl2004_rcv1v2_README.htm
-
Not rated yet.
- Details
"Reuters Corpus Volume 1 (RCV1) (Rose, Stevenson and Whitehead, 2003) [...] consists of over 800,000 newswire stories that have been manually coded using three category sets. However, RCV1 as distributed [...] includes known errors in category ass… More
-
-
Research on N-Grams in Information Retrieval
http://www.cs.umbc.edu/ngram/
-
Not rated yet.
- Details
Fairly nice and comprehensive bibliography, including links to relevant patents and a few applications, except (a) it's mainly IR-oriented and (b) last upated in 1997 (ouch!)
-
-
US Patent 6,621,930: Automatic categorization of documents based on textual content (Smajda)
http://www.freepatentsonline.com/6621930.html
-
Not rated yet.
- Details
Patent by Frank Smajda "Haifa, US, IL" (-: I wonder if the patent could be contested for a fib like that)
-
Didn't find what you were looking for? Try searching Google.
Publish or subscribe to era's Bookmarks Tagged With "ir" via RSS and more...


