1. Library
  2. SF Metrics: Twitter’s Megan Kanne, and Bugsnag’s Emily Nakashima

SF Metrics: Twitter’s Megan Kanne, and Bugsnag’s Emily Nakashima

Light Mode
What Your JavaScript Does When You’re Not Around
Emily Nakashima, Lead Front End Engineer at Bugsnag
Reliable Alerting at Scale
Megan Kanne, Senior Software Engineer at Twitter (Observability Team)
3 min

Earlier this year, Heavybit member company Librato hosted their regular SF Metrics Meetup at our San Francisco Clubhouse. The event featured talks from Twitter’s Megan Kanne, and Bugsnag’s Emily Nakashima. Videos of these talks are included below, sign up here if you’d like to attend in person next time.

What Your JavaScript Does When You’re Not Around

Emily Nakashima, Lead Front End Engineer at Bugsnag

From Emily: when we shipped a big re-write of Bugsnag’s customer-facing dashboards as 40,000 line Javascript app, first we celebrated … and then we looked at our client-side monitoring and did a lot of head-scratching. We’d tested in multiple browsers and thought through dozens of use cases, but when we looked at the kinds of real errors coming back to us from the wild, the reality was so much weirder than we’d expected.

As client-side app frameworks like React and Ember keep growing more popular, we’re shipping more and more application logic out to users’ browsers. But we don’t always know much about what happens to it after we send it out to the client.

In this talk I take you on a fast-paced tour of all the strange cases we’ve looked at since shipping our new dashboard, from overseas proxy sites to rogue browser extensions to out-and-out clones of our UI. Finally, I’ll talk about how to cut the noise and focus on monitoring and mitigating the cases that really matter to your users’ experience.

Reliable Alerting at Scale

Megan Kanne, Senior Software Engineer at Twitter (Observability Team)

Twitter’s Observability team provides Twitter developers a monitoring infrastructure including real time dashboards and alerting for their services. Twitter has grown orders of magnitude since it debuted at SXSW. The monitoring infrastructure has followed suit, ingesting orders of magnitude more metrics, from millions to billions.

As the company grew, so did the challenges of providing an always available alerting system. Design decisions made when Twitter had only one datacenter and a monolithic architecture created unresolvable scale and reliability issues for its alerting service.

In this talk Megan walks us through these challenges and describes Twitter’s solution: a next-gen alerting system that provides reliable, realtime, multi-zone alerting at scale.