System Overview
- Best Reference: MacroBase: Prioritizing Attention in Fast Data. Peter Bailis, Edward Gan, Samuel Madden, Deepak Narayanan, Kexin Rong, Sahaana Suri. SIGMOD 2017. Selected as "Best of SIGMOD 2017".
Design Principles
- Prioritizing Attention in Fast Data: Principles and Promise. Peter Bailis, Edward Gan, Kexin Rong, Sahaana Suri. CIDR 2017.
Unsupervised Classification
- Scalable Kernel Density Classification via Threshold-Based Pruning. Edward Gan, Peter Bailis. SIGMOD 2017.
Explanation and Visualization
- ASAP: Prioritizing Attention via Time Series Smoothing. Kexin Rong, Peter Bailis. VLDB 2017.
- Finding Heavily-Weighted Features in Data Streams. Kai Sheng Tai, Vatsal Sharan, Peter Bailis, Gregory Valiant. arXiv:1722:02305.
- There and Back Again: A General Approach to Learning Sparse Models. Vatsal Sharan, Kai Sheng Tai, Peter Bailis, Gregory Valiant. arXiv:1706.08146.
Domain-specific Data Transformation
- NoScope: Optimizing Neural Network Queries over Video at Scale. Daniel Kang, John Emmons, Firas Abuzaid, Peter Bailis, Matei Zaharia. VLDB 2017
- DROP: Dimensionality Reduction Optimization for Time Series. Sahaana Suri, Peter Bailis. arXiv:1708.00183.
Applications
- Demonstration: MacroBase, a Fast Data Analysis Engine. Peter Bailis, Edward Gan, Kexin Rong, Sahaana Suri. SIGMOD 2017.
- Efficient blind search for small similar-waveform earthquakes in a decade of continuous seismic data (2007-2017) in coastal central California. Clara Yoon, Karianne Bergen, Kexin Rong, Hashem Elezabi, Peter Bailis, Philip Levis, Gregory C. Beroza. SCEC Annual Meeting 2017.