Learning to learn with the informative vector machine
Large margin hierarchical classification
Feature selection \$l_1\$ vs. \$l_2\$ Regularization and Rotational Invariance
Multiple kernel learning, conic duality, and the SMO algorithm
Locally linear metric adaptation for semi-supervised clustering
Online and batch learning of pseudo-metrics