Mon-Fri 09.00 - 17.00 +34 610 30 58 05

Schedule

The schedule is under review and may suffer changes as we are doing a review of the talks.
Submit your talk and participate in this wonderful event!!! Contact dataops at binlogic.net
Expand All +
  • Day 1 | Room A1

    June 21, 2018 | Room A1

  • 09:40 - 10:20
    moving from a monolithic to an micro-service oriented database architecture.
    Databases

  • 10:25 - 11:00
    The purpose of this talk is to review the evolution of Big Data. It starts with an informal retrospective analysis of the factors that have driven us to adopt the Big Data concept: from technology limitations to social behavior evolution. Next, it reviews current misuses of the term Big Data and proposes a meaningful definition of this concept beyond the generic, yet empty, V-based definition. Finally, it discusses current strategies and architectural pipelines to address the storage and computation challenges of large volumes of data.
    Big Data

  • 11:05 - 11:45
    Cities are much more than a specific space. To understand cities we must understand their flows, and to understand their flows, we must understand them as a network of networks: the relationships between the objects that make up the city system. Thanks to the amount of data available in cities around the world, we can test models of scientific disciplines such as complexity sciences, social physics, urban economics or transport theory. This conference will present theories and methods that reveal how cities work through the lens of Big Data.

  • 11:50 - 12:30
    For a project, we have been setting up new MySQL database layer on a cloud environment. This solution is built on virtual machines and consists solely out of open source tools. We implemented a proxy layer based on ProxySQL to create a separation of the MySQL topology from the application layer. Not all applications are built with read-write splitting capabilities or are too hard to reconfigure automatically, the proxy layer helps us with routing the traffic to the correct database servers. For topology management, we use Orchestrator, currently developed by the Database Infrastructure Team at GitHub. Other tools include Percona Xtrabackup as primary backup solution and Percona Toolkit to help performing maintenance in the environment. The entire infrastructure is managed using Puppet. We have developed some open source Puppet modules to be able to implement this all
    Infrastructure
    MySQL

  • 12:35 - 13:15
    Big Data
    Databases
    Engineering
    Infrastructure

  • 14:00 - 14:40
    this talk will explain how HAProxy can be used to monitor live traffic and to detect abusers. Some tips will be provided as well to block or slow down the attacks.
    Security

  • 14:45 - 15:25
    The session will focus on the challenges faced in building an Enterprise Data Lake in a multinational company like Zurich. The creation of a Big Data infrastructure inside a company used to classical enterprise infrastructure of Business Intelligence (BI) brings not only technological challenges but also managerial ones such as the recruitment of the new profiles for the team, the coexisting of the BI and Big Data technologies, the introduction of open source tools and its industrialization, the on-premises versus cloud dilemma among others.
    Big Data
    Real life Cases

  • 15:25 - 16:00
    Automation
    Databases
    MySQL

  • 16:00 - 16:40
    ClickHouse is an open source analytical DBMS. It is capable of storing petabytes of data and processing billions of rows per second per server, all while ingesting new data in real-time. I will talk about the motivation of ClickHouse development and show some notable use cases. Also I will tell about architectural decisions we made with ClickHouse, their consequences from the point of view of an application developer and how to determine if ClickHouse is a good fit for your use case.
    Big Data
    Databases

  • 16:40 - 17:20
    At the end of 2016, Oracle released a new Plugin called MySQL Group Replication, which is a new MySQL replication method that aims to provide better High Availability, and built-in failover with consistency guarantees. I evaluated the initial GA versions back in early 2017. I presented my initial findings with several best practices and concerns with the current implementation which made me state that Group Replication was not quite ready yet. (https://www.slideshare.net/Grypyrg/my-sql-group-replication)

  • 17:20 - 18:00
    Terraform by Hashicorp is a tool of choice for many companies shifting their workloads to cloud-native environment. The most common issue with this tool is that decisions made at the beginning of your journey can be hard to change afterward. I will share a bunch of principals and best practices, which will save your day after going to production.

  • Day 1 | Room A4

    June 21, 2018 | Room A4

  • 09:40 - 10:20
    The General Data Protection Regulation (GDPR), one of the most important changes in data privacy regulation in 20 years, will take effect May 25, 2018. The coming regulation includes a number of new policies requiring organizations to protect the data privacy of all individuals within the European Union (EU). MariaDB TX, a complete database solution for any and every enterprise, includes the security features your organization needs to ensure personal data is protected. MariaDB TX supports encryption of data at rest, data in motion, and data in use. MariaDB TX supports auditing of database events, enabling security and compliance monitoring as well as attack forensics. The database proxy supports the pseudonymization of personal data with the data masking filter, configurable on a per-column, per-user basis.
    Databases
    MariaDB

  • 10:25 - 11:00
    Leverage many HAProxy features to turn it into a wonderful API gateway solution options.
    Infrastructure

  • 11:05 - 11:45
    Engineering
    MySQL
    Where
    Hall 1, Building A

  • 11:50 - 12:30
    FogLAMP is an open source data infrastructure for the Internet of Things, that works in a distributed fashion on nano and microservices, from tiny sensors to large cloud servers. It is a young project on GitHub, but it has already attracted partnerships and contributions from companies like ARM, Arrow, BTE, OSIsoft, Panduit, Sprint, Toshiba... and more to come soon. In this presentation we will talk about the implementation and use of FogLAMP in IoT projects and how FogLAMP can handle an end-to-end interaction between Edge devices, sensors and “things” with high-end Cloud systems. We will evaluate use cases and we will also see FogLAMP in action.
    Databases
    Infrastructure
    Internet of Things

  • 12:35 - 13:15
    Starting with MySQL 5.7 a new Document Store feature has been introduced that makes working with JSON documents an integral part of the MySQL experience. The new X DevAPI gives MySQL users the best of both worlds - SQL and NoSQL - and allows an entirely new category of use cases for managing data. It is constantly evolving based on the community feedback and can be run on top of the brand new

  • 14:00 - 14:40
    Security is an issue, control what can access your data is a must, how to do it is a nightmare. But if you follow us on this journey, you will discover how implementing a quite robust protection is possible. Even more is possible and your performances will improve. Cool right? We will discuss: - Implement selective query access - Define accessibility by location/ip/id - Reduce to a minimum cost of filtering - Automate the query discovery

  • 14:45 - 15:25
    This session describes the automated MySQL failover setup at GitHub, based on orchestrator, Consul and GLB/HAProxy, and which delivers hands-free high availability of our MySQL clusters. Our setup is tolerant to master, intermediate master and complete DC failures, and recovery makes the best of a situation based on state rather than configuration. We will illustrate our multi-DC orchestrator/raft, Consul & load balancers setups, as well as our production testing setup. We will discuss the choices we've made and the path we took while reducing MySQL outage without introducing false positives.
    Databases
    Engineering
    MySQL

  • 15:30 - 16:10
    If you’ve ever had to evaluate server performance, you likely know of sysbench (and if you don’t, come see this session!). In the recent months, Alexey released a major refactor that was mainly to improve how sysbench behaves with new hardware, but there was also a refactor in the Lua interpreter. In this talk we will show how you can customize your benchmarks and take advantage of some of the new features in version 1.0, going through some live demos during the presentation.
    Databases
    Engineering
    Infrastructure

  • 16:15 - 16:55
    Securing your data is only as good as your weakest link. A clear-text password in a file or history file, shared privileges between test and production or open sudo access when you can connect as an unprivileged user all are security flaws. This talk discusses how to navigate the poor defaults MySQL has in place, how to strengthen processes and how to audit your environment. It also covers the complexity of deploying changes in an always available production environment.
    Databases
    Infrastructure
    Security

  • 17:00 - 17:40
    Letgo is a second-hand marketplace where many millions of users meet and one of the most used apps in their sector in the USA. Also, Letgo is a data-driven company and its steady growth of user base has led us to build a dynamic architecture able to scale horizontally to process terabytes of data either in batch as in streaming.

  • Day 2 | Room A1

    June 22, 2018 | Room A1

  • 10:20 - 11:00

  • 11:05 - 11:45
    Redundancy and high availability are the basis for all production deployments. Database systems with large data sets or high throughput applications can challenge the capacity of a single server like CPU for high query rates or RAM for large working sets. Adding more CPU and RAM for vertical scaling is limited. Systems need horizontal scaling by distributing data across multiple servers. MongoDB supports horizontal scaling through sharding. Each shard consists of a replica set that provides redundancy and high availability. In this session we will talk about: -How MongoDB HA works -Replica sets components/deployment typologies -Cluster components - mongos, config servers and shards/replica set -Shard keys and chunks -Hashed vs. range based sharding -Reads vs. writes on sharded cluster
    Databases
    MongoDB

  • 11:50 - 12:30
    Dark Data are data that exist in companies but are not currently leveraged in decision making, due to cost of acquisition, technical complexity, and risk to critical systems. It is estimated that more than 65% of total data in a company is hidden in people, machines and networks. In this talk, Datumize engineers will explain some of the dark data we have found by doing network sniffing and machine unlocking, using real use cases for customers around the world, like a global Tour Operator that is understanding the "lost searches" in a website, or a toxicology company in United States feeding analytical insights into a cloud-based platform with AI connected. All these dark data, once refined, need to be stored and exploiting using a combination of resilient databases and mission-critical infrastructure that scales with load, so extreme operations are needed.

  • 12:35 - 13:15
    At Source{d} we have to analyze a huge amount of git repositories and extract insights on the source code. To do this have created an SQL database engine that is able to understand git repositories and is MySQL protocol compatible. To do that we had to tackle some problems like solving the differences between git repository model and relational databases, create a database engine in go language or finding ways to store and use indexes. You'll learn about the journey from a side project to the current state, its internals and the different solutions we approached.
    Git

  • 14:00 - 14:40
    All of us have faced issues when our application is running slower than it should. In this session, we are going to show you what are the best practices to troubleshoot performance issues when MySQL comes into the discussion. We plan to show you how we do it in our day-to-day work in Percona.
    MySQL
    Real life Cases

  • 14:45 - 15:25
    This talk is structured in 2 parts, the first is an introduction to immutable infrastructures, what it is, advantages and disadvantages and architecture requirements for applications. In the second part I'll explain how we apply those principles at the storage team by Schibsted for our stateful services to provide a managed database offering for multiple customers. The use cases will be based on Cassandra, Kafka and zookeeper which we currently support in many clusters in production.
    Databases
    Infrastructure

  • 15:30 - 16:10
    MongoDB: deploy a 3-members Replica Set with internal encryption
    MongoDB

  • 16:20 - 17:00
    Centralizing all the company's authentication mechanisms to provide SSO solutions it's always challenging. We're going to talk about use cases, and how to solve most common problems with SSO authentication mechanisms and what to do with those applications which still doesn't have support for SAML 2.0.
    Automation
    Infrastructure
    Real life Cases

  • Day 2 | Room A4

    June 22, 2018 | Room A4

  • 09:40 - 10:20
    We are with Cassandra since 0.6 and while normally we like to talk about our feats, our experience also comes from our mistakes. In this talk, we will go over some of our (big) mistakes and lessons learned that way.
    Cassandra
    Real life Cases

  • 10:25 - 11:00
    Spider is a Storage Engine of MariaDB/MySQL for database sharding. Database sharding is a technique of dividing data into multiple servers. In this session, I will introduce Spider and some use cases. And also I will talk about strong point. For example, there are a lot of sharding solutions that don't support cross-shard join. Spider supports it. Cross-shard join is join operation using different shards. In the other hands, if you already have some services and need to shard these data, I think you need to switch to sharding without downtime or with the shortest downtime. In this session, I will talk about this too.
    Spider

  • 11:05 - 11:45
    MySQL 8.0 InnoDB Cluster is a very easy High Availability solution for MySQL where all the complexity is hidded for users and ops. Join this talk to see how easy it is to deploy a 3 nodes cluster live and to understand more the complexity of Group Replication.
    Databases
    MySQL
    Real life Cases
    Where
    Hall 1, Building A

  • 11:50 - 12:30
    A Marriage Made in the Cloud, Pushed Down to the Edge. Never heard of Fog Computing? If you are interested or involved in IoT, you should! Fog Computing is the ability to bring the architectures that power the Cloud down to the whole infrastructure of IoT. Smart buildings, cities, factories and fleets require a highly scalable, distributed infrastructure from the ground up. Similarly to the Cloud, the most important aspect is that this infrastructure must be powered by Open Source software. In this talk, we will present the basics of IoT and Fog Computing infrastructures, and the tight relationship with Open Source software projects. We will inspect and analyze use cases of open source systems in IoT, what are the main challenges and priorities.
    Infrastructure
    Internet of Things

  • 12:35 - 13:15
    I will talk about how to deploy MySQL on Kubernetes / Openshift. My use case will be deploying Percona XtraDB Cluster (PXC) or MySQL Replication together with MySQL Proxy and implementing read/write split, which will allow legacy applications to improve scalability without changing a single line in the code. (If you never used Kubernetes / Openshift or never used PXC / MySQL Proxy - I will do a quick introduction to both) In addition, we can scale the replication slave servers automatically - if the load increases

  • 14:00 - 14:40
    Istio is quite a hot topic today, and there are reasons for it. It helps developers to build microservices much faster than before and reliably deploying them to production environments with additional functionality out of the box. Backed by giants like Google, IBM and Lyft, Istio is getting attention and being very actively developed, and is completely open sourced!. In this talk we are going to take an overview of Istio, what can it do for you and how to get started with it.
    Databases
    MySQL
    Where
    Hall 1, Building A

  • 14:45 - 15:25
    Databases
    MySQL

  • 15:30 - 16:10
    Goal: Present benefits of ICT transformation in the enterprise market by taking benefits of Cloud infrastructure, Automation and DevOps mindset. Topics: > Evolution from traditional datacenter infrastructures to Cloud infrastructure; > Benefits of Cloud systems, OpenStack and AWS studies; > How to gain business agility using Cloud and cloud-native tools (e.g. Kubernetes) > Trending technologies, products, tools and its architectures; > Q&A
    Cloud
    Infrastructure

  • 16:15 - 16:55
    Given Cassandra nature to scale, automation is an important tool to know. Also, to improve resilience, some go to multi-cloud deployments. In this talk, we will go over how to automate a full Cassandra deployment over several Clouds. We will not only go over the automation tools used (Terraform and Ansible) but also the pitfalls to avoid in this process.
    Ansible
    Automation
    Cassandra
    Terraform

  • 17:00 - 17:40
    The public cloud is making a huge impact on the way enterprises host, manage, and scale their database operations. They’ve been freed from the scalability constraints of their on-premise systems. They can provision new infrastructure at the click of a button, without a lengthy hardware procurement process. They no longer need large, upfront capital investments to launch new IT projects. And they’re taking advantage of new, cloud-oriented technologies when storing and analyzing their data. Through a range of Database as a Service (DBaaS) options, public cloud vendors now make it easier than ever for organizations to migrate and maintain their databases. But while these solutions address many of the headaches involved in database management such as migration, provisioning, and administration, there are significant differences between the various DBaaS offerings on the market. In this session, we explore the core DBaaS solutions provided by the leading cloud platforms, AWS, GCP and Microsoft Azure, and compare key features such as the types of the database on offer, Infrastructure Design Challenges, DR and HA capabilities, Performance Indicators, Pricing and Database Security.
    Databases
    Infrastructure