Session: Data plumbing basics: Build, deploy, and scale ML models for your time series data

Tun Shwe, VP of Data @ Quix

Jay Clifford, Developer Advocate @ InfluxData

“Collect”, “Store” and “Act” are the three key pillars to building any time series-based solution. While acting upon your data holds paramount importance, it simultaneously presents a puzzle of questions:

  • How do I query, transform, and process my stored time series data?
  • How can I build and run anomaly detection or forecasting algorithms on my time series data?
  • How can I efficiently scale and expand my time series analytics engine?

In this talk, we will explore the methodology for crafting a scalable time series data pipeline, leveraging the event streaming platform, Quix, and the time series database, InfluxDB. We will also walk through the process of creating, training, and deploying a machine-learning model, utilizing the power and flexibility of Keras and Hugging Face for anomaly detection.

Key Takeaways:

  • Collecting and storing time series data: Grasp the nuances of managing time series data with InfluxDB
  • Model training and storage: Dive into model creation and training using Keras and Hugging Face
  • Pipeline construction and deployment: Explore building and deploying a resilient time series data pipeline with Quix

Join this session to learn how to build a foundational but scalable architecture that you can plumb into your own time series solutions.

Where & when?

Open Source Data Summit 2024 will be held on October 2nd, 2024.

What is the cost of access to the live virtual sessions?

OSDS is always free and open for all to attend.

What is Open Source Data Summit?

OSDS is a peer-to-peer gathering of data industry professionals, experts, and enthusiasts to explore the dynamic landscape of open source data tools and storage.

The central theme of OSDS revolves around the advantages of open source data products and their pivotal role in modern data ecosystems.

OSDS is the annual peer hub for knowledge exchange that fosters a deeper understanding of open source options and their role in shaping the data-driven future.

Who attends OSDS?

OSDS is attended by data engineers, data architects, developers, DevOps practitioners and managers, and data leadership.

Anyone who is looking for enriched perspectives on open source data tools and practical insights to navigate the evolving data landscape should attend this event.

Join again on October 2nd, 2024 for discussions around:
  • Benefits of open source data tools
  • Cost/performance trade-offs
  • Building data storage solutions
  • Challenges surrounding open source data tool integration
  • Solutions for the cost of storing, accessing, and managing data
  • Data streams and ingestion
  • Hub-and-spoke data integration models
  • Choosing the right engine for your workload
Interested in speaking or sponsoring Open Source Data Summit 2024?

Submit a talk proposal here or reach out to astronaut@solutionmonday.com.

Don't miss out on important updates! Register for access to Open Source Data Summit 2024

Register for OSDS 2024 Access

"*" indicates required fields