Talks & Publications
Here is an overview of my talks, podcasts, scientific papers, external articles, and other published content.
External Articles
- The Future of SQL: Databases Meet Stream Processing, Confluent, Nov 2021
- Digitalisation World Q&A on Kafka, Interview with Digitalisation World, Nov 2020
- Streams and Tables in Apache Kafka: Elasticity, Fault Tolerance, and other Advanced Concepts, Confluent, Jan 2020
- Streams and Tables in Apache Kafka: Processing Fundamentals with Kafka Streams and ksqlDB, Confluent, Jan 2020
- Streams and Tables in Apache Kafka: Topics, Partitions, and Storage Fundamentals, Confluent, Jan 2020
- Streams and Tables in Apache Kafka: A Primer, Confluent, Jan 2020
- Democratizing Stream Processing with Apache Kafka and KSQL, InfoQ, Jun 2018
- Big, fast, easy data with KSQL, O’Reilly Media, Feb 2018
- Kafka: Watermarks, Tables, Event Time, and the Dataflow Model, Confluent, May 2017
- Secure Stream Processing with the Streams API in Kafka, Confluent, Jul 2016
- Elastic Scaling in the Streams API in Kafka, Confluent, Jul 2016
- Distributed, Real-time Joins and Aggregations on User Activity Events using Kafka Streams, Confluent, Jun 2016
Podcasts
- Event Streaming Trends and Predictions for 2021 ft. Gwen Shapira, Ben Stopford, and Michael Noll, Confluent’s Streaming Audio Podcast, Jan 2021
- Apache Kafka mit Michael Noll, Programmier.bar Podcast (German), Dec 2020
- 5 Years of Event Streaming and Counting ft. Gwen Shapira, Ben Stopford, and Michael Noll, Confluent’s Streaming Audio Podcast, Aug 2020
- Apache Kafka Fundamentals: The Concept of Streams and Tables ft. Michael Noll, Confluent’s Streaming Audio Podcast, May 2020
Talks & Presentations
- Apache Kafka and the Data Mesh, Kafka Summit Americas, Sep 2021
- Apache Kafka and the Data Mesh, Kafka Summit Europe, May 2021
- Databases Are Only Half-Done, Keynote at W-JAX Germany, Nov 2020
- Tradeoffs in Distributed Systems Design: Is Kafka The Best?, Kafka Summit Austin, Aug 2020
- Keynote: The Database is Only Half-Done, Confluent Streaming Event, Vienna, Austria, Jun 2020
- Now You See Me, Now You Compute: Building event-driven architectures with Apache Kafka, Strata Data Conference, New York, USA, Sep 2019
- Kafka 102: Streams and Tables All the Way Down, Kafka Summit, San Francisco, USA, Oktober 2019
- Stream Processing with Apache Kafka, Kafka Meetup Zurich, Switzerland, Apr 2019
- Apache Kafka in Theory and Practive, guest lecture at Mining Streaming Data course, Hasso Plattner Institute, Potsdam/Berlin, Germany, Apr 2019
- An Introduction to the event streaming platform Apache Kafka, guest lecture at Data Stream Processing and Analytics course at ETH Zurich, Switzerland, Mar 2019
- Big, Fast, Easy Data: distributed stream processing for everyone with KSQL, the streaming SQL engine for Apache Kafka, Berlin Buzzwords, Berlin, Germany, Jun 2018
- Unlocking the world of stream processing with KSQL, the streaming SQL engine for Apache Kafka (slides), Strata Data Conference Europe, London, UK, May 2018
- Rethinking Stream processing with Apache Kafka, Google DevFest Switzerland, Oct 2017
- Rethinking Stream processing with Apache Kafka, Zurich Apache Kafka Meetup, Switzerland, Sep 2017
- Stream Processing with Apache Kafka, Dublin Apache Kafka Meetup, Ireland, Jul 2017
- Rethinking Stream Processing with Apache Kafka: Applications vs. Clusters, Streams vs. Databases, Berlin Buzzwords, Germany Jun 2017
- Stream Processing with Apache Kafka’s Streams API, Apache Kafka meetup Munich, Germany, Jun 2017
- Rethinking Stream processing with Apache Kafka: Applications vs. Clusters, Streams vs. Databases, Strata Data Conference Europe, London, UK, May 2017
- The Best Thing Since Partitioned Bread: Rethinking Stream Processing with Apache Kafka’s new Streams API, Kafka Summit, New York, May 2017
- Kafka’s Streams API: An Overview, Apache Kafka meetup Munich, Germany, January 2017
- Introducing Kafka Streams, the new stream processing library of Apache Kafka, Berlin Buzzwords, Germany, Jun 2016
- Being Ready for Apache Kafka: Today’s Ecosystem and Future Roadmap, ApacheCon: Big Data, Hungary, Sep 2015
- Practical Pig and Pig Unit, Talk at Swiss Big Data User Group, ETH Zurich, Jul 2012
- Telling Experts from Spammers: Expertise Ranking in Folksonomies, Talk at Decentralized Information Group, CSAIL, MIT, Boston, USA, Jul 2009
- Telling Experts from Spammers: Expertise Ranking in Folksonomies, Presentation at the 32nd Annual ACM SIGIR Conference, Boston, USA, Jul 2009
- On Measuring Expertise in Collaborative Tagging Systems, Presentation at the Web Science Conference ‘09 (WebSci), Athens, Greece, March 2009
- The Metadata Triumvirate: Social Annotations, Anchor Texts and Search Queries, Presentation at the 7th IEEE/WIC/ACM International Conference on Web Intelligence (WI), Sydney, Australia, December 2008
- Building a Scalable Collaborative Web Filter with Free and Open Source Software, Presentation at the 4th IEEE International Conference Signal-Image Technology & Internet-based Systems (SITIS), Bali, Indonesia, November 2008
- Web Search Personalization via Social Bookmarking and Tagging, Presentation at 6th International Semantic Web Conference (ISWC) & 2nd Asian Semantic Web Conference (ASWC), Busan, South Korea, November 2007
- GooDiff: An Online Consumer Service for Monitoring Changes in Legal Documents, BarCamp talk, HACK conference, Luxembourg, Oct 2007
- Authors vs. Readers: A Comparative Study of Document Metadata and Content in the WWW, Presentation at 7th International ACM Symposium on Document Engineering (DocEng), Winnipeg, Canada, August 2007
- Design and Anatomy of a Social Web Filtering Service, Presentation at CIC conference, Hong Kong, Oct 2006
- An Exploratory Study of Internet Content Rating Systems, Presentation at HACK conference, Luxembourg, Oct 2005
Academic Papers & Publications
- SPEAR: Spamming-resistant Expertise Analysis and Ranking in Collaborative Tagging Systems (ask me for a free
copy)
C.-M. Au Yeung, M. G. Noll, N. Gibbins, C. Meinel, N. Shadbolt
International Journal of Computational Intelligence, Wiley-Blackwell, Volume 27, Issue No. 3, 2011 (Impact Factor: 3.31) - Measuring Expertise in Online Communities
C.-M. Au Yeung, M. G. Noll, N. Gibbins, C. Meinel, N. Shadbolt
IEEE Intelligent Systems, Volume 26, Issue No. 1, January/February 2011, pp. 26-32, ISSN 1541-1672 (BibTeX) - Understanding and Leveraging the Social Web for Information Retrieval (Ph.D. Thesis)
M. G. Noll
Ph.D. thesis, submitted in the context of a cotutelle de thèse at both the Hasso Plattner Institute, Germany, and the University of Luxembourg, April 2010 - Telling Experts from Spammers: Expertise Ranking in Folksonomies
M. G. Noll, C.-M. Au Yeung, N. Gibbins, C. Meinel, N. Shadbolt
SIGIR ‘09: Proceedings of 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Boston, USA, July 2009, pp. 612-619, ISBN 978-1-60558-483-6 (ACM Link, BibTeX)
(Acceptance Rate: 16%, 78/494)
Read the Technology Review article on this work » - On Measuring Expertise in Collaborative Tagging Systems
C.-M. Au Yeung, M. G. Noll, N. Gibbins, C. Meinel, N. Shadbolt
WebSci ‘09: Proceedings of 1st Web Science Conference, Athens, Greece, March 2009 (BibTeX)
(Acceptance Rate: 16%) - Writing a Personal Link Recommendation Engine
M. G. Noll
Python Magazine, Volume 3(2), February 2009, ISSN 1913-6714 - The Metadata Triumvirate: Social Annotations, Anchor Texts and Search Queries
M. G. Noll, C. Meinel
WI ‘08: Proceedings of 7th IEEE/WIC/ACM International Conference on Web Intelligence, IEEE CS Press, Sydney, Australia, December 2008, pp. 640-647, ISBN 978-0-7695-3496-1 (IEEE Link, BibTeX)
(Acceptance Rate: 19%) - Building a Scalable Collaborative Web Filter with Free and Open Source Software
M. G. Noll, C. Meinel
SITIS ‘08: Proceedings of 4th IEEE International Conference on Signal-Image Technology & Internet-based Systems, IEEE CS Press, Bali, Indonesia, November 2008, pp. 563-571, ISBN 978-0-7695-3493-0 (IEEE Link, BibTeX)
(Acceptance Rate: 33%) - Exploring Social Annotations for Web Document Classification
M. G. Noll, C. Meinel
SAC ‘08: Proceedings of 23rd International ACM Symposium on Applied Computing, Fortaleza, Ceará, Brazil, March 2008, pp. 2315-2320, ISBN 978-1-59593-753-7 (ACM Link, BibTeX)
(Impact Factor: 0.85) - Web Search Personalization via Social Bookmarking and Tagging
M. G. Noll, C. Meinel
ISWC ‘07: Proceedings of 6th International Semantic Web Conference & 2nd Asian Semantic Web Conference, Springer LNCS 4825, Busan, South Korea, November 2007, pp. 367-380, ISBN 978-3-540-76297-3 (SpringerLink, BibTeX)
(Acceptance Rate: 19%, 50/255) - Authors vs. Readers: A Comparative Study of Document Metadata and Content in the WWW
M. G. Noll, C. Meinel
DocEng ‘07: Proceedings of 7th International ACM Symposium on Document Engineering, Winnipeg, Canada, August 2007, pp. 177-186, ISBN 978-1-59593-776-6 (ACM Link, BibTeX) - Design and Anatomy of a Social Web Filtering Service
M. G. Noll, C. Meinel
CIC ‘06: Proceedings of 4th International Conference on Cooperative Internet Computing, Hong Kong, October 2006, pp. 35-44, ISBN 978-981-281-109-7 (WSP Link, BibTeX) - Web Page Classification: An Exploratory Study of Internet Content Rating Systems
M. G. Noll, C. Meinel
HACK ‘05: Proceedings of HACK conference, Luxembourg, October 2005, ISBN 978-2-9599708-0-1 (BibTeX)
Selected Press Coverage
- New Ranking Algorithm Separates Digital Wheat from Chaff, 2009
Article on Communications of the ACM (CACM) Online - A Better Way to Rank Expertise Online, 2009
Article by Technology Review (US), published by Massachusetts Institute of Technology (MIT); the article features external feedback by the HITS ranking algorithm inventor Prof. Jon Kleinberg and others - Finding Better Friends: Delicious and SPEAR, 2009
Article on ReadWriteWeb, one the Top blogs worldwide on Web technology - How SPEAR Identifies Domain Experts within Delicious (link is broken nowadays), 2009
Invited article for Yahoo!, published on the Delicious.com blog - Speer gegen Spam (German), 2009
Interview with 20 Minutes, the most popular daily newspaper in Switzerland
Research Data Sets
- CABS120k08 (published 2008)
Large research data set about Web metadata based on a sample of 120,000 web documents with data retrieved from the Open Directory Project, the AOL Search query log corpus AOL500k, Google PageRank, Delicious.com/Yahoo!, and anchor text from incoming hyperlinks - DMOZ100k06
(published 2007)
Large research data set about document metadata based on a random sample of 100,000 web documents from the Open Directory combined with data retrieved from Delicious.com/Yahoo!, Google, and ICRA.
Patents
- Detecting co-occurrence patterns in DNS, US9680842B2, 2017
Tutorials
See my separate Tutorials section.