BIG DATA CONFERENCE
Carlos Manuel Duclos-Vergara
In this workshop we will look deeper into the architecture of Kafka in order to understand how we can get the best performance. During the course of this workshop we will build some applications in order to highlight the different performance outcomes based on the way we design the application.
The Workshop will cover basic concepts of Hadoop and mostly in The Cloudera stack, like using HBase & Impala to query data, using Spark to stream data, afterwards we will launch a Cloudera quickstart, using datasets of top-rated movies in the workshops, getting the data analyzed and queried with Hadoop, explaining & demonstrating Map Reduce Concepts, RDD Partition on Spark.
The workshop will cover the basics of a Machine Learning project, from start to production release. We will concentrate on the optimization part of the chosen model. You will learn to use the ONNX Runtime to serve the model, check the performance compared to the initial model, and use a programming language different from the starting one.
This workshop will cover the things that developers and DBAs can do to improve security in MySQL by mainly covering security-related issues pertaining to MySQL, but also putting some performance aspects into the mix – people will learn how to secure their MySQL instances and keep them performing at the best of their ability at the same time.
Paweł Skrzypek & Alicja Reniewicz
During our workshop, entitled: “Proactive and polymorphic adaptation of multi-cloud deployments”, we will present how to implement Multi-Cloud native strategies using an advanced open-source framework. Such framework allows for Cloud-agnostic Cross-Cloud deployment and optimized management of a Cloud application based on flexible monitoring, context-aware maximization of the application owner’s utility of the deployed application, and autonomic reconfiguration based on the application’s current execution context. We provide a practical introduction to Multi-Cloud application modelling, configuration, deployment, and adaptation during this workshop. All stages of the Cloud deployment planning and designing process will be shown. Also, all the key steps in the deployment and autonomic application management will be demonstrated.
Flux is a lightweight data scripting language for fast-prototyping streaming and time-series databases. It is maintained by InfluxData, i.e., the company behind the most popular time-series database. This half-day course provides an introduction to the InfluxDB 2.0 and It covers fundamentals about time series analysis and stream processing. Central to the course is the use of Fluxlang by InfluxData. The course will introduce you to Flux core concepts and it will make use of Influx Cloud free tier.