How can users who know neither programming nor statistics explore large databases? We present a novel interface, designed to guide explorers through their data: Blaeu. Blaeu is a database front-end, “boosted” with unsupervised learning primitives. Thanks to these primitives, it can summarize and recommend queries. Our first contribution is Blaeu’s interaction model. With Blaeu, users explore the data through data maps. A data map is an interactive set of clusters, which users navigate with zooms and projections. Our second contribution is Blaeu’s engine. We present three mapping algorithms, for three different settings. The first algorithm deals with small to medium databases, the second one targets high dimensional spaces, and the last one focuses on speed and interaction. We then present an optimization strategy based on sampling. Our experiments reveal that Blaeu can cluster millions of tuples with hundreds of columns in a few seconds on commodity hardware.

 Cluster-Driven Navigation of the Query Space

System Configuration:

 

 

 

HARDWARE REQUIREMENTS:

 

Hardware                             –     Pentium

Speed                                   –     1.1 GHz

RAM                                   –    1GB

Hard Disk                           –    20 GB

Key Board                          –    Standard Windows Keyboard

Cluster-Driven Navigation of the Query Space  Cluster-Driven Navigation of the Query Space  Cluster-Driven Navigation of the Query Space  Cluster-Driven Navigation of the Query Space  Cluster-Driven Navigation of the Query Space  Cluster-Driven Navigation of the Query Space  Cluster-Driven Navigation of the Query Space  Cluster-Driven Navigation of the Query Space