Monday, January 06, 2025
OpenTelemetry Weaver
Via the official OTel GitHub organization: check out OpenTelemetry Weaver,
a tool (written in Rust) that enables developers to easily develop, validate,
document, and deploy semantic conventions … and more to come!
Open-source o11y stack
My tryst with Observability by Jishnu Srivastava is a hands-on experience story
around OTel and open-source observability. Thanks for sharing!
Prometheus 3.0
Wondering where we are with the newest version? Read Prometheus 3.0 Brings New UI,
OpenTelemetry Support and More by Matt Saunders (via InfoQ) and learn all
about it!
OTel for Android issues
Via the official CNCF blog: Francisco Prieto Cardelle of Embrace wrote
Solving Android app issues with OpenTelemetry: Beyond local profiling.
Observability in 2025
Yup, it's this time of the year, indeed. TheNewStack's B. Cameron Gain on
Observability in 2025: OpenTelemetry and AI to Fill In Gaps. Who would have
thought to find AI in there, shocker :)
That was it for this edition of the o11y newsletter.
Feel free to share news items with me via Bluesky DM.
Stay safe and hope to see you around next week!
Continue Reading
Monday, January 06, 2025
With large language model (LLM) products such as ChatGPT and Gemini taking over the world, we need to adjust our skills to follow the trend.
Continue Reading
Monday, January 06, 2025
View on sreweekly.com A message from our sponsor, incident.io: Ever wonder how Netflix handles incidents at their scale? With incident.io, they’ve built a process that’s smooth, scalable, and keeps everyone on the same page. Tools like Catalog and Workflows make it intuitive for teams to tackle incidents consistently—no matter how big the challenge. https://incident.io/customers/netflix Your […]
Continue Reading
Thursday, January 02, 2025
Metrics are a cornerstone element in evaluating any AI system, and in the case of large language models (LLMs), this is no exception.
Continue Reading
Wednesday, January 01, 2025
Machine learning is now the cornerstone of recent technological progress, which is especially true for the current generative AI stampede.
Continue Reading
Monday, December 30, 2024
Netflix and title launches
On the Netflix Technology Blog, a new article was published: read
Title Launch Observability at Netflix Scale where Varun Khaitan writes
about metrics that matter to a (movie) title’s success. The post explains why
going beyond tracking system metrics such as error rates, latencies,
and CPU utilization matters. Excellent food for thoughts!
Recapping OTel panel
ICYM the live panel broadcast, read up on Colin Contreary's Holiday Sweaters & OTel Insights: Recapping
our “2025 Unwrapped” panel. Of course, the article also contains a link to
the video recording of this awesome panel, if you prefer to watch it ;)
2024 retro
Dotan Horovits and Charity Majors in another great OpenObservability Talks episode:
read the End-of-Year Observability Retrospective to learn more. Dotan,
KUTGW!
Prometheus vulnerabilities
300,000+ Prometheus Servers and Exporters Exposed to DoS Attacks by Yakir
Kadkoda and Assaf Morag of Aqua Security highlights vulnerabilities and security
flaws within the Prometheus ecosystem. Recommend to peruse and apply the
learnings.
That was it for this edition of the o11y newsletter.
Feel free to share news items with me via Bluesky DM.
Stay safe and hope to see you around next week!
Continue Reading
Monday, December 30, 2024
One of the most talked-about niches in tech is machine learning (ML), as developments in this area are expected to have a significant impact on IT as well as other industries.
Continue Reading
Monday, December 30, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: This New Year, resolve to make incident management smarter, faster, and way less stressful with FireHydrant. Modern on-call, automated incident response, and AI tools that do the heavy lifting. https://firehydrant.com/ Preventing Out-of-Memory (OOM) Kills in Kubernetes: Tips for Optimizing Container Memory Management In this post, we’ll […]
Continue Reading
Tuesday, December 24, 2024
Artificial intelligence (AI) research, particularly in the machine learning (ML) domain, continues to increase the amount of attention it receives worldwide.
Continue Reading
Monday, December 23, 2024
Monitoring Apache Kafka
M. Cagri AKTAS wrote Monitoring Kafka Clusters: Setup Guide for JMX Exporter,
Prometheus, and Grafana, a step-by-step tutorial to set up an open-source
observability stack for Apache Kafka. Nice work, thank you!
OTel sum connector
In Introduction to the OpenTelemetry Sum Connector Jeremy Hicks explains
a new OTel collector component that allows you to create sums from attributes
attached to logs and spans. Jeremy not only shows how it works in action but
also discusses use cases.
Observability 3.0?
The Future of Observability: Observability 3.0 by Hazel Weakly is an
opinion piece that I'd invite you to read and reflect on. Hazel makes the point
that the focus should shift to deriving business value beyond engineering by
enabling effective actions based on the insights. Also, data lakes. The latter
is something I keep running into more often, recently.
OpenSearchCon 2025
The OpenSearchCon Europe CfP is now open and will close on Jan 19 2025, with
one of the focus areas being observability. OpenSearchCon Europe will take place
in Amsterdam Apr 30 and May 1. Submit early, submit often ;)
That was it for this edition of the o11y newsletter.
Feel free to share news items with me via Bluesky DM.
Stay safe and hope to see you around next week!
Continue Reading