4 Ways to Prepare Your Application for Remote Workers

My Post - 2020-03-26T115832.340.pngCompanies all across the world are adjusting to new working from home policies and are taking precaution to limit the impact diseases are having on the lives of employees and customers. The virus has created a ripple effect impacting everything from a visit to the local grocery store to countless conference cancellations. And the world became aware of this crisis only a little over a month ago. Tech companies have responded by asking, and even requiring, employees to work remotely. The CEO of Zoom Communications stated publicly that usage is at an all-time high, most likely due to restrictions on travel.

For companies that deliver applications that enable remote work and collaboration, this has obvious implications. To enable a sudden and potentially sustained burst of utilization, there needs to be a business continuity plan. Executives at these companies must be asking:

  1. How do we keep our employees safe and productive?
  2. How do we continue to meet SLAs as usage increases?
  3. What is our capacity planning strategy?
  4. For incidents that do occur, are we adequately prepared to address them?
  5. As the utilization of services increases, what is the impact on margins?

Indeed, these questions should be top of mind for those companies in the remote workspace, but even companies who now may have larger employee counts working remotely on in-house applications face similar challenges.

These are questions we’re thinking about here at Splunk, where we treat data as the fuel that helps us make better decisions.

From a technical operations perspective, we’ve identified 4 areas where companies can find these answers:

  1. Measure what matters
  2. Drive standardization of tools
  3. Employ an effective escalation policy
  4. Make learning a part of the process

Measure What Matters

Access to accurate, discoverable, and timely data is what drives collaborative planning and response. Even in the era of the cloud, resources are not limitless. It is critical to develop a deep understanding of infrastructure utilization and how application changes over time have affected performance and reliability, particularly when capacity planning. However, baseline analysis doesn’t adequately safeguard against future incidents. An effective metrics system will be capable of firing an alert within seconds, ensuring fast mean-time-to-acknowledge (MTTA) and detection (MTTD). Distributed tracing has become the go-to debugging approach for more complex application architectures, where multiple services are called to fulfill individual requests. Its effectiveness in identifying causality during incidents can also help technical teams better understand the overall impact on application performance by aggregating metadata contained within the traces to produce tag-specific SLIs.

Drive Standardization of Tools

Unfamiliarity with tools and data sets used across teams creates a huge obstacle in driving responsiveness and cross-team collaboration. It is not uncommon for two teams to produce different metrics from the same datasets. The more tools, the more likely one will encounter data that is or may appear inconsistent. Time will be spent debating dashboard and data validity, rather than focusing on capacity planning and updating runbooks. When something does go wrong, the last thing the incident manager wants to run into are conflicting tools and dashboards. As open-source data collection grows in popularity, and IT Operations companies grow the breadth of offerings, there are more options than ever to collapse the observability stack. – Read more

Learn More About Splunk

Remote Work: Splunk + Zoom

My Post - 2020-03-23T174422.271.pngAs everyone is taking proactive measures to stay healthy, organizations are increasingly having their employees work from home. At Splunk, we are focused on bringing data to every question, decision and action — and remote work for us equals Zoom for online meetings and workspaces.

As our customers use Splunk for real-time data processing and analytics, they use our Splunk Mobile App (AndroidiOS) when they need to take their dashboards on the go. Our engineering teams leverage this same capability as well when collaborating remotely.

Actionable Alerts – Zoom War Room

With Splunk Mobile, you can receive alerts wherever you are and take action. These actions can be anything — and we mean anything — as they are simply URL webhooks that you define to attach to your alerts.

Here’s an example of how we use alerts with Zoom.

The Splunk Mobile Engineering team uses Zoom to create “War Rooms” when we need to get everyone in the same place to troubleshoot critical issues quickly. Zoom makes it easy for us to automatically create these meeting rooms through its API. Whenever there is a Splunk generated alert, we attach a Zoom link and send it to all relevant stakeholders. Once received, all stakeholders can join the room immediately at the tap of a button. What’s more, they’ll all be looking at the same dashboard that triggered the alert in the first place.

Sharing Splunk Dashboards with Zoom

Splunk Mobile has sharing features built-in to the app. Examples include sharing your data as captured images or as a live link to an interactive dashboard. Our favorite thing to do is to either copy that link to a Zoom chat or Slack thread where people can kick off their own investigations. – Read more

Learn More About Splunk and Zoom