## Discrepancy and $\epsilon$-Approximations for Bounded VC-Dimensions


Emo Welzl (joint work with Jiri Matousek, Lorenz Wernisch)
Freie Universität Berlin
Institut für Informatik
Email: welzl@inf.fu-berlin.de
Report B 91-06
April 1991

Get the report here or by anonymous ftp:
Server: fubinf.inf.fu-berlin.de
File:   pub/reports/tr-b-91-06.ps.gz

Let $(X,\RR)$ be a set system on an $n$-point set $X$. We investigate colorings $\chi:X\to\{-1,+1\}$, such that the sum of colors in each set $R \in \RR$ does not deviate too much from $0$. The largest value $| \sum_{x \in R} \chi(x)|$ over all $R \in \RR$ is the {\em discrepancy} of $\chi$ on ${\cal R}$. We also consider {$\eps$-ap\-prox\-i\-ma\-tions}: For $0 < \eps < 1$, a subset $A$ of $X$ is an {\em $\eps$-ap\-prox\-i\-ma\-tion} for $(X,\RR)$, if $|\, |A \cap R| / |A| - |R| / n \,| < \eps$ for all $R \in \RR$. Let $d$ be a fixed integer such that for any $Y\subseteq X$, the number of distinct sets of the form $R\cap Y$, $R\in\RR$ is bounded by $O(|Y|^d)$ (i.e., $(X,\RR)$ is an $n$-point range space with the {\em primal shatter function} $\pi_{\cal R}(m)$ bounded by $O(m^d)$). Then we prove that there is always a coloring with discrepancy bounded by $O(\pdiscub)$. We show that this implies that, for any $r$, there exists a {\em $(1/r)$-ap\-prox\-i\-ma\-tion} for $(X,{\cal R})$ of size $O(\paproub)$. This improves on a previous bound of $O(r^2\log r)$ due to Vapnik and Chervonenkis. If any subcollection of $m$ sets of $\RR$ partitions the points into at most $O(m^d)$ classes (i.e., the {\em dual shatter function} is bounded by $O(m^d)$), then we get a bound of $O(\ddiscub)$ for discrepancy and of $O(\daproub)$ for $(1/r)$-ap\-prox\-i\-ma\-tions. These bounds via the dual shatter function can be realized by deterministic polynomial time algorithms. All the bounds are tight upto polylogarithmic factors in the worst case. Our results allow to generalize several results of Beck bounding the discrepancy in certain geometric settings to the case when the discrepancy is taken relative to an arbitrary measure.