ECONOMETRICS; Scalable and hyper-parameter-free non-parametric covariate shift adaptation with conditional sampling (Professor Lionel Truquet)

Abstract

Covariate shift has become a prominent problem setting in machine learning and related fields. It concerns many situations for which a model trained on a source sample has to be applied to a target sample with covariates having a different probability distribution. Many existing covariate shift adaptation methods estimate sample weights to be used in the risk estimation in order to mitigate the gap between the source and the target distribution. However, non-parametrically estimating the optimal weights typically involves computationally expensive hyper-parameter tuning that is crucial to the final performance. In this paper, we propose a new non-parametric approach to covariate shift adaptation which avoids estimating weights and has no hyper-parameter to be tuned. Our basic idea is to label unlabeled target data according to the k-nearest neighbors in the source dataset. Our analysis indicates that setting k=1 is an optimal choice. Thanks to this property, there is no need to tune any hyper-parameters, unlike other non-parametric methods. Moreover, our method achieves a running time quasi-linear in the sample size with a theoretical guarantee, for the first time in the literature to the best of our knowledge. Our results include sharp rates of convergence for estimating the joint probability distribution of the target data. In particular, the variance of our estimators has the same rate of convergence as for standard parametric estimation despite their non-parametric nature. Our numerical experiments show that proposed method brings drastic reduction in the running time with accuracy comparable to that of the state-of-the-art methods.

Click here to view paper.

Date
Monday, 25 March 2024

Time
4pm to 5pm

Venue
Lim Tay Boh Seminar Room; AS02 03-12
Scroll to Top