Abstract

Large scale machine learning requires tradeoffs. Commonly this tradeoff has led practitioners to choose simpler, less powerful models, e.g. linear models, in order to process more training examples in a limited time. In this work, we introduce parallelism to the training of non-linear models by leveraging a different tradeoff--approximation. We demonstrate various techniques by which non-linear models can be made amenable to larger data sets and significantly more training parallelism by strategically introducing approximation in certain optimization steps.

For gradient boosted regression tree ensembles, we replace precise selection of tree splits with a coarse-grained, approximate split selection, yielding both faster sequential training and a significant increase in parallelism, in the distributed setting in particular. For metric learning with nearest neighbor classification, rather than explicitly train a neighborhood structure we leverage the implicit neighborhood structure induced by task-specific random forest classifiers, yielding a highly parallel method for metric learning. For support vector machines, we follow existing work to learn a reduced basis set with extremely high parallelism, particularly on GPUs, via existing linear algebra libraries.

We believe these optimization tradeoffs are widely applicable wherever machine learning is put in practice in large scale settings. By carefully introducing approximation, we also introduce significantly higher parallelism and consequently can process more training examples for more iterations than competing exact methods. While seemingly learning the model with less precision, this tradeoff often yields noticeably higher accuracy under a restricted training time budget.

Committee Chair

Kunal Agrawal

Committee Members

Roger Chamberlain, Robert Pless

Comments

Permanent URL: https://doi.org/10.7936/K70863FB

Degree

Doctor of Philosophy (PhD)

Author's Department

Computer Science & Engineering

Author's School

McKelvey School of Engineering

Document Type

Dissertation

Date of Award

Winter 12-15-2014

Language

English (en)

DOI

https://doi.org/10.7936/K70863FB

Recommended Citation

Tyree, Stephen, "Approximation and Relaxation Approaches for Parallel and Distributed Machine Learning" (2014). McKelvey School of Engineering Theses & Dissertations. 64.

The definitive version is available at https://doi.org/10.7936/K70863FB

Download

Included in

Engineering Commons

COinS

DOI

https://doi.org/10.7936/K70863FB

McKelvey School of Engineering Theses & Dissertations

Approximation and Relaxation Approaches for Parallel and Distributed Machine Learning

Abstract

Committee Chair

Committee Members

Comments

Degree

Author's Department

Author's School

Document Type

Date of Award

Language

DOI

Recommended Citation

Included in

DOI

Search

Links

Browse

Author Corner

McKelvey School of Engineering Theses & Dissertations

Approximation and Relaxation Approaches for Parallel and Distributed Machine Learning

Author

Abstract

Committee Chair

Committee Members

Comments

Degree

Author's Department

Author's School

Document Type

Date of Award

Language

DOI

Recommended Citation

Included in

Share

DOI

Search

Links

Browse

Author Corner