Financial Forecasting Using Character N-Gram Analysis and Readability Scores of Annual Reports

15 years 5 months ago

Download users.cs.dal.ca

Abstract. Two novel Natural Language Processing (NLP) classiﬁcation techniques are applied to the analysis of corporate annual reports in the task of ﬁnancial forecasting. The hypothesis is that textual content of annual reports contain vital information for assessing the performance of the stock over the next year. The ﬁrst method is based on character n-gram proﬁles, which are generated for each annual report, and then labeled based on the CNG classiﬁcation. The second method draws on a more traditional approach, where readability scores are combined with performance inputs and then supplied to a support vector machine (SVM) for classiﬁcation. Both methods consistently outperformed a benchmark portfolio, and their combination proved to be even more eﬀective and eﬃcient as the combined models yielded the highest returns with the fewest trades. Key words: automatic ﬁnancial forecasting, n-grams, CNG, readability scores, support vector machines

Matthew Butler, Vlado Keselj

Real-time Traffic

AI 2009 | Annual Report | Artificial Intelligence | Support Vector Machines | ﬁnancial Forecasting |

claim paper

Post Info
More Details (n/a)

Added	02 Sep 2010
Updated	02 Sep 2010
Type	Conference
Year	2009
Where	AI
Authors	Matthew Butler, Vlado Keselj

Comments (0)

Sciweavers

Financial Forecasting Using Character N-Gram Analysis and Readability Scores of Annual Reports

AI 2009 | Annual Report | Artificial Intelligence | Support Vector Machines | ﬁnancial Forecasting |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers