Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

141

TREC
2008

118views Information Technology» more TREC 2008»

IIT Kharagpur at TREC 2008 Blog Track

15 years 8 months ago

IIT Kharagpur at TREC 2008 Blog Track

Download trec.nist.gov

This paper describes our opinion retrieval system for TREC 2008 blog track. We focused on five different aspects of the system. The first module is focussed on extracting the blog content out from junk html and thereby decreasing the noise in the indexed content. The second module aims at removing various kind of spam content from real blogs. The third module aimed at retrieving the relevant documents. The fourth module filters out opinionated documents and the fifth one calculated the polarity of the sentiments in the document. The final ranked retrieval runs were based on various combination of settings in each module so as to study the effect of each. For classification of subjectivity and polarity, the predictions we done using a complementary naive bayes classifier

Robin Anil, Sudeshna Sarkar

Real-time Traffic

Final Ranked Retrieval | Information Technology | Module Filters | TREC 2008 | Trec 2008 Blog |

claim paper

Related Content

» TREC 2008 at the University at Buffalo Legal and Blog Track

» FUB IASICNR and University of Tor Vergata at TREC 2008 Blog Track

» Overview of the TREC 2007 Blog Track

» University of Lugano at TREC 2008 Blog Track

» FEUP at TREC 2008 Blog Track Using Temporal Evidence for Ranking and Feed Distillation

» IIT at TREC10

» KLE at TREC 2008 Blog Track Blog Post and Feed Retrieval

» York University at TREC 2008 Blog Track

» THUIR at TREC 2008 Blog Track

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2008
Where	TREC
Authors	Robin Anil, Sudeshna Sarkar

Comments (0)