|
|
 |
Supervised Learning
Classification, regression, and relatives comprise the most widely-used type of statistical task.
fastlab
Big Ideas
People
Stuff
|
 |
|
 |
Isometric Separation Maps
Nikolaos Vasiloglou, Alexander Gray, and David Anderson
Machine Learning and Signal Processing (MLSP) 2009
An approach to learning the kernel for support vector machines, which can guarantee linear separability in the kernel space.
[pdf]
Abstract:
Maximum Variance Unfolding (MVU) and its variants have
been very successful in embedding data-manifolds in lower
dimensional spaces, often revealing the true intrinsic dimension.
In this paper we show how to also incorporate supervised
class information into an MVU-like method without
breaking its convexity. We call this method the Isometric
Separation Map and we show that the resulting kernel
matrix can be used as a binary/multiclass Support Vector
Machine-like method in a semi-supervised (transductive)
framework. We also show that the method always finds a
kernel matrix that linearly separates the training data exactly
without projecting them in infinite dimensional spaces. In
traditional SVMs we choose a kernel and hope that the data
become linearly separable in the kernel space. In this paper
we show how the hyperplane can be chosen ad hoc and the
kernel is trained so that data are always linearly separable.
Comparisons with Large Margin SVMs show comparable performance.
@Inproceedings{vasiloglou2009ism,
Author = "Nikolaos Vasiloglou and Alexander G. Gray and David Anderson",
Title = "{Learning Isometric Separation Maps}",
Booktitle = "IEEE International Workshop on Machine Learning For Signal Processing (MLSP)",
Year = "2009"
}
|
New Algorithms for Efficient High-Dimensional Nonparametric Classification
Ting Liu, Andrew W. Moore, Alexander G. Gray
Journal of Machine Learning Research (JMLR), 2006
Fast exact algorithms for nearest-neighbor classification (and related problems), exploiting the fact that solving this does not strictly require finding the nearest neighbors and demonstrating speedups in higher dimensionalities than typical for nearest neighbor search.
[pdf]
Abstract:
This paper is about non-approximate acceleration of high-dimensional nonparametric operations such as k nearest neighbor classifiers. We attempt to exploit the fact that even if we want exact answers to nonparametric queries, we usually do not need to explicitly find the data points close to the query, but merely need to answer questions about the properties of that set of data points. This offers a small amount of computational leeway, and we investigate how much that leeway can be exploited. This is applicable to many algorithms in nonparametric statistics, memory-based learning and kernel-based learning. But for clarity, this paper concentrates on pure k-NN classification. We introduce new ball-tree algorithms that on real-world data sets give accelerations from 2-fold to 100-fold compared against highly optimized traditional ball-tree-based k-NN. These results include data sets with up to 10^6 dimensions and 10^5 records, and demonstrate non-trivial speed-ups while giving exact answers.
@article{liu2006nnclsf,
Author = "Ting Liu and Andrew W. Moore and Alexander G. Gray",
Title = "{New Algorithms for Efficient High Dimensional Nonparametric
Classification}",
journal = "Journal of Machine Learning Research (JMLR)",
volume = "7",
pages = "1135--1158",
year = "2006"
}
|
|
|
 |
|
|
Optimization of Kernel Machines
We are developing efficient optimization methods for regularized kernel machines, including support vector machines.
[see webpage here]
|
|
|
 |
|
|
Feature-Selecting Support Vector Machines
We are exploring efficient methods for learning L0 and L1 sparse support vector machines.
|
|
|
|