Sciweavers

2827 search results - page 429 / 566
» Marking Text Documents
Sort
View
SIGIR
2008
ACM
13 years 10 months ago
Learning from labeled features using generalized expectation criteria
It is difficult to apply machine learning to new domains because often we lack labeled problem instances. In this paper, we provide a solution to this problem that leverages domai...
Gregory Druck, Gideon S. Mann, Andrew McCallum
SIGIR
2012
ACM
12 years 1 months ago
To index or not to index: time-space trade-offs in search engines with positional ranking functions
Positional ranking functions, widely used in web search engines, improve result quality by exploiting the positions of the query terms within documents. However, it is well known ...
Diego Arroyuelo, Senén González, Mau...
ICML
2007
IEEE
14 years 11 months ago
Self-taught learning: transfer learning from unlabeled data
We present a new machine learning framework called "self-taught learning" for using unlabeled data in supervised classification tasks. We do not assume that the unlabele...
Rajat Raina, Alexis Battle, Honglak Lee, Benjamin ...
WWW
2005
ACM
14 years 11 months ago
A multilingual usage consultation tool based on internet searching: more than a search engine, less than QA
We present a usage consultation tool, based on Internet searching, for language learners. When a user enters a string of words for which he wants to find usages, the system sends ...
Kumiko Tanaka-Ishii, Hiroshi Nakagawa
KDD
2006
ACM
141views Data Mining» more  KDD 2006»
14 years 11 months ago
Statistical entity-topic models
The primary purpose of news articles is to convey information about who, what, when and where. But learning and summarizing these relationships for collections of thousands to mil...
David Newman, Chaitanya Chemudugunta, Padhraic Smy...