Patent Classification Experiments with the Linguistic Classification System LCS

14 years 3 months ago

Download lands.let.ru.nl

In the context of the CLEF-IP 2010 classification task, we conducted a series of experiments with the Linguistic Classification System (LCS). We compared ment representations for patent abstracts: a bag-of-words representation and a syntactic/semantic representation containing both words and dependency triples. We evaluated two types of output: using a fixed cut-off on the ranking of the classes and using a flexible cut-off based on a threshold on the classification scores. Using the Winnow classifier, we obtained an improvement in classification scores when triples are added to the bag of words. However, our results are remarkably better on a held-out subset of the target data than on the 2 000-topic test set. The main findings of this paper are: (1) adding dependency triples to words has a positive effect on classification accuracy and (2) selecting classes by using a threshold on the classification scores instead of returning a fixed number of classes per document improves classific...

Suzan Verberne, Merijn Vogel, Eva D'hondt

Real-time Traffic

Classification Scores | CLEF 2010 | CLEF-IP 2010 Classification | Dependency Triples | Information Technology |

claim paper

Post Info
More Details (n/a)

Added	08 Nov 2010
Updated	08 Nov 2010
Type	Conference
Year	2010
Where	CLEF
Authors	Suzan Verberne, Merijn Vogel, Eva D'hondt

Comments (0)

Sciweavers

Patent Classification Experiments with the Linguistic Classification System LCS

Classification Scores | CLEF 2010 | CLEF-IP 2010 Classification | Dependency Triples | Information Technology |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers