Conference item icon

Conference item

In-database learning with sparse tensors

Abstract:
In-database analytics is of great practical importance as it avoids the costly repeated loop data scientists have to deal with on a daily basis: select features, export the data, convert data format, train models using an external tool, reimport the parameters. It is also a fertile ground of theoretically fundamental and challenging problems at the intersection of relational and statistical data models. This paper introduces a unified framework for training and evaluating a class of statistical learning models inside a relational database. This class includes ridge linear regression, polynomial regression, factorization machines, and principal component analysis. We show that, by synergizing key tools from relational database theory such as schema information, query structure, recent advances in query evaluation algorithms, and from linear algebra such as various tensor and matrix operations, one can formulate in-database learning problems and design efficient algorithms to solve them. The algorithms and models proposed in the paper have already been implemented inside the LogicBlox database engine and used in retail-planning and forecasting applications, with significant performance benefits over out-of-database solutions that require the costly data-export loop.
Publication status:
Published
Peer review status:
Peer reviewed

Actions


Access Document


Files:
Publisher copy:
10.1145/3196959.3196960

Authors


More by this author
Institution:
University of Oxford
Oxford college:
St Cross College
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Computer Science
Role:
Author


Publisher:
Association for Computing Machinery
Host title:
SIGMOD/PODS '18 Proceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems
Journal:
ACM Principles of Database Systems More from this journal
Pages:
325-340
Publication date:
2018-05-27
Acceptance date:
2017-09-01
Event location:
Houston
DOI:
ISBN:
9781450347068


Keywords:
Pubs id:
pubs:725630
UUID:
uuid:2d852e0d-889d-46fe-890e-b1ac5687c798
Local pid:
pubs:725630
Source identifiers:
725630
Deposit date:
2017-09-05

Terms of use



Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP