Reducing the False Positive Rate for Correlated Queries with the Adaptive Cuckoo Filter (ACF)

February 13, 2018

Pedro Reviriego


Reducing the False Positive Rate for Correlated Queries with the Adaptive Cuckoo Filter (ACF)

Time:   10:45am
Location:   Meeting room 302 (Mountain View), level 3

In this talk we will present the adaptive cuckoo filter (ACF), a data structure for approximate set membership that extends cuckoo filters by reacting to false positives, removing them for future queries. As an example application, in packet processing queries may correspond to flow identifiers, so a search for an element is likely to be followed by repeated searches for that element. Removing false positives can therefore significantly lower the false positive rate. The ACF, like the cuckoo filter, uses a cuckoo hash table to store fingerprints. We allow fingerprint entries to be changed in response to a false positive in a manner designed to minimize the effect on the performance of the filter. We will show that the ACF is able to significantly reduce the false positive rate by presenting both a theoretical model for the false positive rate and simulations using both synthetic data sets and real packet traces.