SREDAY

Site Reliability, DevOps and Cloud

Sep 15-16, 2022 London, UK

2
Days
30+
Speakers
2
Tracks
100+
Attendees

Event finished

Tickets closed

Schedule

Day 1

Keynote: Supercharging Observability with Feature Flagging

Freelance
Feature flags allow you to enable and disable code without changing or deploying any source code, as well as letting you selectively route traffic to certain users or a percentage of certain users, along with other great tricks. It’s powerful stuff … but when you combine it with observability...

Keynote: The state of SRE in 2022

SRE Author
Come and explore the landscape of SRE as it is in 2023, with the new trends, techniques and tools on the horizon.

10:00

Keynote: Developers are all the same

Postman
We know that developers are not all the same. But wait, they kind of are. This talk shares lessons learned from 20+ million developers to help SREs better engage with application developers.

10:30

Coffee break

Main lobby


Let's play the SRE Game!

Xebia
Whether you are already working with SRE in your organization or thinking about implementing this methodology, you learn the best by doing, and doing is much more fun when playing. The SRE game provides a safe environment to start experimenting with SRE. In the game, you will be challenged to...

Managing Containers at Scale

Slim.ai
Moving your application to containers can be daunting. Many organizations have struggled with the move to microservices and containerized workflows. We’ve been there, and now that we’ve found the light at the end of the tunnel, we want to share it!

Multi Region Deployments with AWS CI/CD and Terraform

AWS
Have you ever looked into multi region deployments on AWS whether to enhance user experience or ensure business continuity to name a few? Are you interested to hear about how to achieve repeatability of multi region deployments with AWS CI/CD and Terraform? If yes, this talk is for you.

12:30

Kubernetes, Lessons Learnt

AWS
This session will focus on lesson learnt from architecting and deploying (large) kubernetes clusters at production environments. The lessons are all real world, Ones that me and the customers have experienced. If I can "Save" one soul I have done my part :-)

13:00

Lunch & networking

Main lobby


Incidents: the customer empathy workshop you never wanted

FireHydrant
Organizations are focusing on incidents more than ever but failing to leverage them to their full potential. But by framing incidents and post-incident reviews as customer empathy-building opportunities, we can facilitate more creative technical problem-solving, unlock improvements to your...

14:30

A State of Continuous Merge: The Secret to Happier, More Productive Dev Teams

LinearB
Developer happiness is a nearly exact reflection of how happy your code merge path is... when it's riddled with friction, this impacts dev experience as well. We'll dive into how to change this.

Overcoming CVE Shock - Adding Perspective in Vulnerability Scanning

Armo
“CVE shock” is the state of total helplessness felt by a dev or security engineer facing the overwhelming list of CVEs returned by the vulnerability scanner. Sound familiar? We’ll unpack this and help you overcome it.

15:30

The Freedom of Kubernetes requires Chaos Engineering to shine in production

Dynatrace
Like any other technology transformation, k8s adoption typically starts with small “pet projects”. One k8s cluster here, another one over there. If you don’t pay attention, you may end up like many organizations these days, something that spreads like wildfire: hundreds or thousands of k8s...

Policy as [versioned] Code

UK Government
Policy often causes more harm than good, is slow to update, exemptions are harder still to manage, measuring compliance at scale is near on impossible. Throwing some curly braces at a problem is not the solution. How do we fix it?

16:30

Day 2

10:30

Coffee break

Main lobby


SRE Workshop

Grafana Labs
Hands-on session on latest SRE tools from Grafana Labs

11:30

Automation with Scalable & Secure SSH

TIDAL
How do you deliver security and reliability on a network with thousands of servers with complete automation? We will use signed certificates with principals for SSH to automate the infrastructure with Ansible. It will scale SSH securely and reliably to automate thousand of servers using Ansible.

12:00

Reliability Runs Efficiently at the Junction of Automation and Agile

Capital One
Reliability Engineering is more relevant daily, but there is a right way and a wrong way to approach it. The wrong way is manual tracking and validation. The right way is automation and single-source of truth. We designed this presentation to show how automation can make Teams' work self-managing.

12:30

Technology is Necessary, But Not Sufficient

OpenCredo
Adopting and evolving technologies in an organisation is important and can offer significant advantages. However, translating these advantages into bottom-line benefits often comes from combining technical change with social change - we must not just change the technology, but also rewrite the...

13:00

Lunch & networking

Main lobby


14:00

Migrating a monolith to Cloud-Native and the stumbling blocks that you don’t know about

IBM
Have you started your Cloud-Native journey? Now that you’ve gotten past that first hurdle, I’m here to say there’s more to think about. I’ll talk about stumbling blocks that a masked enterprise called AsgharLabs had to deal with, and hopefully shed some light on things you haven’t thought about.

Chaos Engineering for Cloud native Apps

Thoughtworks
Improve application resilience with chaos testing by deliberately introducing faults that simulate real-world outages. Azure Chaos Studio is a fully managed chaos engineering experimentation platform for accelerating discovery of hard-to-find problems, from late-stage development through production.

15:00

DevSecOps - The Story of 3 Security Breaches

ShadowMap
Security is generally the compromise made in the interest of faster release cycles. This talk covers the story of three real-world security breaches and the DevSecOp failures behind them. Featuring vastly different environments, these stories highlight the commonalities in most security breaches.

Stateless Linux Kernel Crashdumps

Cloudflare
Ordinary Linux kdump support shares state between host system and crashkernel via root disk. How do you dump if there's no disks at all, or root disk LUKS keys aren't sharable with the crashkernel ? We demo a setup that successfully crashdumps by HTTP PUT using only crashkernel cmdline arguments.

16:00

Impostor syndrome in the IT world from a conference speaker's perspective

Contino
Impostor Syndrome affects most of the people working in IT. It affects also conference speakers. I will share with you my struggles in public speaking and how I fight the impostor syndrome at every conference. Hopefully helping you to fight impostor syndrome in your every day life.

From Zero Visibility to Full Observability with Grafana

Grafana Labs
The journey from zero to hero in the Observability field!

Secure your secrets with automation

Microsoft
Secrets are something we need to implement to secure our apps/api communication, how can we handle secrets securely? This session will focus on secret definition, why it's important to rotate them and how to handle them in a secure way. This session is based on a real business case, and include a...

How to make Dev and Ops folks look good as an SRE

StormForge
There are many ways to make someone look good. As an SRE you can help your Dev and Ops colleagues with making sure their applications run reliably, perform well and don’t consume too much resources. Do this in a large environment, with many moving parts and you’ll be their hero. During this talk...

18:00

Dev Team Metrics that Matter

LinearB
What is the most valuable outcome? This is the most important question to answer before starting an engineering metrics program. In this talk I discuss which dev team metrics matter most, the outcome they produce, and pitfalls to avoid when starting a metrics initiative.

An SRE guide to Linux Kernel upgrades

Cloudflare
Are you afraid of production Linux Kernel upgrades? Too risky? But what if I told you that it is more risky NOT to upgrade your kernel regularly? And what if I told you it is safer to deploy the Linux Kernel than any other software? This talk aims to demystify Linux Kernel releases !

19:00

Speakers

Aengus Rooney
Grafana Labs
Read more →
Aengus Rooney & Willie Engelbrecht
Grafana Labs
Read more →
Ajuna Kyaruzi
Datadog
Read more →
Alayshia Knighten
Freelance
Read more →
Andrea Francesco Giunta
Microsoft
Read more →
Antonio Cobo
Contino
Read more →
Ariel Illouz
LinearB
Read more →
Ashish Bhalgat
Thoughtworks
Read more →
Ben Hirschberg
Armo
Read more →
Chris Nesbitt-Smith
UK Government
Read more →
Frank Hofmann
Cloudflare
Read more →
Gunnar Grosch
AWS
Read more →
Henrik Rexed
Dynatrace
Read more →
Ignat Korchagin
Cloudflare
Read more →
Jelmer de Jong & Lennart Timmers
Xebia
Read more →
JJ Asghar
IBM
Read more →
Joe Scholz
Capital One
Read more →
Joyce Lin
Postman
Read more →
Kobi Biton
AWS
Read more →
Lerna Ekmekcioglu
AWS
Read more →
Martin Wimpress
Slim.ai
Read more →
Miko Pawlikowski
SRE Author
Read more →
Niels Roetert
StormForge
Read more →
Rajat Gupta
TIDAL
Read more →
Ryan McDonald
FireHydrant
Read more →
Simon Copsey
OpenCredo
Read more →
Yash Kadakia
ShadowMap
Read more →
Yishai Beeri
LinearB
Read more →

Venue

Design District, North Greenwich, London

The Bureau,
The Gateway Pavilions,
Peninsula Square,
London SE10 0QE, UK

Tube access
Jubilee line: North Greenwich station

Sponsors & Partners

Want to become a sponsor? Get in touch!