Statistical topic models provide a general data-driven framework for automated discovery of high-level knowledge from large collections of text documents. While topic models can p...
Chaitanya Chemudugunta, Padhraic Smyth, Mark Steyv...
Abstract. Orange (www.ailab.si/orange) is a suite for machine learning and data mining. It can be used though scripting in Python or with visual programming in Orange Canvas using ...
Janez Demsar, Blaz Zupan, Gregor Leban, Tomaz Curk
This paper describes how a user modeling knowledge base for personalized TV servers can be generated starting from an analysis of lifestyles surveys. The aim of the research is the...
Measure words in Chinese are used to indicate the count of nouns. Conventional statistical machine translation (SMT) systems do not perform well on measure word generation due to ...
Dongdong Zhang, Mu Li, Nan Duan, Chi-Ho Li, Ming Z...
Reuse distance (i.e. LRU stack distance) precisely characterizes program locality and has been a basic tool for memory system research since the 1970s. However, the high cost of m...
Xipeng Shen, Jonathan Shaw, Brian Meeker, Chen Din...