The CEAS 2008 Spam Filter Challenge (August 5th -- August 8th, 2008) is a competition which most interesting side is that it will be "live", that is, filters will be tested in parallel on a real world message stream, following TREC procedures. The CFP for the CEAS conference is closed.
Besides, the program and papers for the Fourth International Workshop on Adversarial Information Retrieval on the Web (AIRWEB 2008) are available. There are several "must read" for a content classification fan like me:
- Cleaning Search Results using Term Distance Features
Josh Attenberg and Torsten Suel - Exploring Linguistic Features for Web Spam Detection: A Preliminary Study
Jakub Piskorski, Marcin Sydow and Dawid Weiss - Latent Dirichlet Allocation in Web Spam Filtering
Istvan Biro, Jacint Szabo and Andras Benczur - Analysing Features of Japanese Splogs and Characteristics of Keywords
Yuuki Sato, Takehito Utsuro, Tomohiro Fukuhara, Yasuhide Kawada, Yoshiaki Murakami, Hiroshi Nakagawa and Noriko Kando - Webspam Identification Through Content and Hyperlinks
Jacob Abernethy, Olivier Chapelle and Carlos Castillo - The Anti-Social Tagger - Detecting Spam in Social Bookmarking Systems
Beate Krause, Christoph Schmitz, Andreas Hotho and Gerd Stumme
And of course, the results of the Web Spam Challenge!
No hay comentarios:
Publicar un comentario