DMTCS Proceedings, 2005 International Conference on Analysis of Algorithms

Font Size:  Small  Medium  Large
DMTCS Conference vol AD (2005), pp. 157-166

DMTCS

2005 International Conference on Analysis of Algorithms

Conrado Martínez (ed.)

DMTCS Conference Volume AD (2005), pp. 157-166


author: Frédéric Giroire
title: Order statistics and estimating cardinalities of massive data sets
keywords: cardinality, estimates, very large multiset, traffic analysis
abstract: We introduce a new class of algorithms to estimate the cardinality of very large multisets using constant memory and doing only one pass on the data. It is based on order statistics rather that on bit patterns in binary representations of numbers. We analyse three families of estimators. They attain a standard error of
1/√
M
using
M
units of storage, which places them in the same class as the best known algorithms so far. They have a very simple internal loop, which gives them an advantage in term of processing speed. The algorithms are validated on internet traffic traces.
  If your browser does not display the abstract correctly (because of the different mathematical symbols) you may look it up in the PostScript or PDF files.
reference: Frédéric Giroire (2005), Order statistics and estimating cardinalities of massive data sets, in 2005 International Conference on Analysis of Algorithms, Conrado Martínez (ed.), Discrete Mathematics and Theoretical Computer Science Proceedings AD, pp. 157-166
bibtex: For a corresponding BibTeX entry, please consider our BibTeX-file.
ps.gz-source: dmAD0115.ps.gz (98 K)
ps-source: dmAD0115.ps (242 K)
pdf-source: dmAD0115.pdf (136 K)

The first source gives you the `gzipped' PostScript, the second the plain PostScript and the third the format for the Adobe accrobat reader. Depending on the installation of your web browser, at least one of these should (after some amount of time) pop up a window for you that shows the full article. If this is not the case, you should contact your system administrator to install your browser correctly.

Due to limitations of your local software, the two formats may show up differently on your screen. If eg you use xpdf to visualize pdf, some of the graphics in the file may not come across. On the other hand, pdf has a capacity of giving links to sections, bibliography and external references that will not appear with PostScript.


Automatically produced on Di Sep 27 10:09:31 CEST 2005 by gustedt

Valid XHTML 1.0 Transitional