IMPACT 2019: Anomaly detection at scale for performance engineers – Tuli Nivas, Salesforce

IMPACT 2019: Context-driven performance testing – Alexander Podelko, Oracle

March 7, 2019

IMPACT 2019: Demystifying z/OS CPU measurements – Scott Chapman, Enterprise Performance Strategies

March 7, 2019

IMPACT 2019: Anomaly detection at scale for performance engineers – Tuli Nivas, Salesforce

As performance engineers, we understand the importance of software testing during and after development to identify any and all performance bottlenecks. Due to various constraints, be it a scaled down test environment, data volume or code integration limitations, it is not always possible to catch all bugs in test. It is because of this that anomaly detection in production takes on an even bigger significance. There is always the possibility of customers getting impacted if performance bottlenecks are not identified and resolved in a timely manner, but the scale at which this kind of anomaly detection needs to be done is also noteworthy. Few servers in test versus thousands of servers in production with time being of the utmost importance, anomaly detection at scale is one the biggest challenges for a performance engineer.

One of the most widely used techniques to identify performance bugs is to look at time series data for the various metrics that could then possibly pinpoint a potential problem. This approach does not scale well in production, even if time series data can be consolidated into a few charts. Seeing how time consuming this kind of analysis can be, this presentation illustrates how applying simple statistics and basic linear regression principles can improve productivity of a performance engineer, tenfold or even more. Automated anomaly detection in production by using simple data science techniques along with eliminating reliance on time series data, can be beneficial not only to how long it takes to identify an issue, but also how quickly we can get customers out of an outage.

Videos sponsored by:

Upcoming Events

Impact of Sentiment Analysis on Improving Fake News Detection

September 18 @ 11:00 am - 12:00 pm EST

In his talk on "Impact of Sentiment Analysis on Improving Fake News Detection," Sanjaikanth E. Vadakkethil Somanathan Pillai addresses a critical issue exacerbated...

Find out more

Software for Humanity 2024

August 14 @ 12:30 pm - 5:30 pm EST

Software for Humanity is a thought-provoking and immersive event that brings together software engineers, developers,...

Find out more

Application Performance Benchmarking within a CI/CD Pipeline

July 31 @ 11:00 am - 12:00 pm EST

What is the traditional approach to performance monitoring on the Mainframe? Industry professionals know that...

Find out more

IMPACT 2019: Anomaly detection at scale for performance engineers – Tuli Nivas, Salesforce

IMPACT 2019: Context-driven performance testing – Alexander Podelko, Oracle

IMPACT 2019: Demystifying z/OS CPU measurements – Scott Chapman, Enterprise Performance Strategies

IMPACT 2019: Anomaly detection at scale for performance engineers – Tuli Nivas, Salesforce

Videos sponsored by:

Related posts

IMPACT2020 | Business Continuity & Contingency Planning or, Who Turned out the Lights -Jonathan Gladstone

Solving Mysteries with Observability: Adding Additional Context to Distributed Tracing

IMPACT2021 | Proactive Capacity Management is dead; long live Capacity Management! – Bob Torz

Upcoming Events

Impact of Sentiment Analysis on Improving Fake News Detection

Software for Humanity 2024

Application Performance Benchmarking within a CI/CD Pipeline