Thesis icon

Thesis

Natural language processing for economic and financial modelling

Abstract:

Over the past years, researchers have introduced various natural language processing (NLP) methods to the fields of economics and finance. It is time to take stock and systematically evaluate which NLP methods work best for the most common domain specific tasks, under the most common domain specific data characteristics, and to let those findings guide future research. We underline the importance of evaluating multimodal (text and numeric) data tasks, since many text datasets in economics and finance come accompanied by potentially relevant numeric metadata. Particularly in the field of modern monetary economics, multimodal data analysis is of vital importance. Ever since the global financial crisis, central bank communication has become a key monetary policy tool. The development of multimodal data analysis frameworks is pivotal for modern day monetary policy research, yet the integration of NLP methods into economic and financial modelling processes is far from being conclusively answered.

In this DPhil thesis, we introduce work towards establishing an NLP benchmark foundation for economics and finance that includes multimodal dataset evaluation tasks. Next, we introduce Bayesian Topic Regression (BTR). BTR is a multimodal NLP algorithm based on a supervised topic model that can jointly model text and numeric data. With BTR, we provide the research community with a method for more reliable causal inference with text data, in an identification framework commonly used in the economics and finance literature. BTR also demonstrates competitiveness in prediction tasks that rely on text and numeric data. Finally, we introduce a multimodal NLP framework for the application space of monetary policy analysis. We establish a new monetary policy shock series based on central bank speeches along three key macroeconomic measures: GDP growth, inflation, and unemployment. Based on empirical estimates on our newly constructed central bank communication dataset, our monetary news signals imply that news on macroeconomic outlooks in central bank communication can help explain equity and bond market volatility and tail risk. The news signals carry relevant information to which markets attend and react. We also derive a monetary news dispersion index, measuring the degree of alignment amongst different central bankers in their policy communication with the markets. Our findings suggests that more misaligned policy communication is associated with stronger market surprises at policy announcement time.

Actions

Access Document

Files:

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author

Contributors

Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Supervisor
Institution:
University of Oxford
Role:
Supervisor
ORCID:
0000-0002-9003-6642


More from this funder
Funder identifier:
https://ror.org/03n0ht308
Programme:
ESRC Advanced Quantitative Methods Scholarship


DOI:
Type of award:
DPhil
Level of award:
Doctoral
Awarding institution:
University of Oxford

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP