Release Notes

 

2.2.3 (latest)

kfuse-2.2.3 introduces term search for logs and external dependency tracking for APM services.

New Features and Improvements

Logs

Term search is now the default search type for logs. Users can quote the search string to use the older string contains search. Term search is faster and more efficient in general.

Logs analytics screen is simplified and now much easier to use. You have option to add the queries to a dashboard.

You can now group fingerprints by multiple attributes. Earlier only grouping available was by source.

APM

The external dependencies for APM services are shows in the service details page.

When you select a specific span from the trace details, you can now see the related logs based on various attributes such as, traceId, pod, etc. You can see the metrics related to the service or endpoint.

Queries for rate (for counter type) and histogram quantile (for histograms) are much faster now.

Spinner is now displayed in the APM pages while columns values are not yet available.

Infrastructure

Infrastructure tab is now revamped with cleaner and slicker UI. In the related logs, you can now open logs details for any specific log line.

Platform

Query that run over the timeout or cause out of memory issues are terminated automatically. In addition, pods are restarted after hitting Out of Memory issue.

2.2.2

kfuse-2.2.2 introduces a key new feature - Error Analytics for Elastic APM.

New Features

User can now perform analytics on Elastic APM errors globally as well as see error types, frequency and last occurrences for a given service.

Logs

Each input JSON log line is sorted internally by key names before indexing. This improves the storage efficiency and search speed by reducing number of unique patterns detected in logs stream.

Distributed Tracing/APM

Slice and dicing on APM errors as well as errors chart/table in the service details page.

Fixed various discrepancies in calculating RED metrics from the incoming spans and transactions for Elastic APM data.

Metrics

Various performance improvements for auto-completion of metric names and labels as well as metric summarization.

2.1.0

kfuse-2.1.0 introduces two new features - Advanced Service Monitoring and TraceQL support.

New Features

ASM provides autonomic observability based on eBPF technology. kfuse-knight agent discovers and tracks all services and their interactions. ASM provides RED and USE metrics without any extra instrumentation or change to the application code. ASM also provides curated advanced alerts to detect anomalous and outliers behaviour in the services.

Spans can be queried now using TraceQL. In addition, you can view service map and flame-graph through Grafana.

Logs

Ability to skip auto extraction from JSON logs through logs parser configuration.

Charting was broken for grammar derived facets.

JSON message parsing has been optimized to reduce the CPU cost for logs parsing. This optimization is applicable to all log lines including structured JSON logs, embedded and partial JSON strings.

Distributed Tracing/APM

Span details now show stack-trace, local variables and context for Elastic APM. In addition to flame-graph spans are listed in a separate span list tab.

From the span details, user can now filter by (include/exclude) custom span attributes in addition to the standard OTel attributes.

Control plane

Various fixes for Control Plane dashboards and alerts.

2.0.0

kfuse-2.0.0 is a major release with the significant feature enhancements for our customers.

New Features

Filtering and navigation has been made uniform and more streamlined across all the streams including logs, traces, events and metrics.

Kloudfuse now has support for Service Level Objective. Users can set latency and availability SLOs for any service instrumented with distributed tracing.

Kloudfuse now has support for Single Sign On and support for various authorization methods including Google, Okta, Azure amongst others.

Kloudfuse alerting now supports Change, Outliers, Anomaly and Forecast alert types in addition to the threshold. Read more.

Kloudfuse catalog service enables support for migrating dashboards and alerts from external grafana to kfuse. Read more.

Logs

Logs parsing extracts, detects and color codes the data types of facets automatically to make it easier to work with large amount of logs data.

Logs parser pipeline stages can be configured through remap, relabel and transform actions/stages. This allows the users to configure and process logs data from any agent including fluent-bit, fluent-d, OTEL collector, DD-agent amongst others.

JSON message parsing has been optimized to reduce the CPU cost for logs parsing.. This optimizations apply not only to structured JSON logs but also to log lines containing embedded or partial JSON strings.

Distributed Tracing/APM

Apart from OTEL collector/format, Kloudfuse stack now supports Elastic APM and DD APM payload formats. The pipeline can be configured to drop/relabel various attributes as required.

Kloudfuse stack produces unified span derived metrics that can be configured to have arbitrary dimensions. If needed more span derived metrics can be produced by applying any filters and time/space aggregate to incoming data. The metric data can be retained independently of trace retention.

Users can download the full span data in two different formats; CSV format downloads only the columns shown in the UI whereas JSON format downloads all the attributes of incoming span that are stored by the stack.

Alerts

Kloudfuse alerting now supports Change, Outliers, Anomaly and Forecast alert types in addition to the threshold. Read more.

Infrastructure

The Infrastructure view to explore every Kubernetes resources now contains related logs, traces, metrics and events.

Control plane

Control plane now contains information on forecasted disk and outliers resource usage capabilities for the Kloudfuse observability stack resources.

1.3.4

kfuse-1.3.4 is a minor release update with the pointed bug fixes for Elastic APM ingestion.

1.3.3

kfuse-1.3.3 is a minor release update with the following feature enhancements for our customers.

Logs

Log data is sorted by multiple keys (fingerprints, labels, timestamp); this results in more efficient disk storage and as a result better query perf.

Log queries can be saved as views to be referenced later or as adhoc dashboards to be used by various team members.

Logs events can be downloaded from kfuse UI in three different formats.

  1. TXT: Raw log message as emitted by the application.

  2. CSV: Comma separated log msg along with all the fields that are shown in the kfuse logs UI screen.

  3. JSON: Full detailed log event with all the facets and labels associated with the log event.

The logs download is limited to 10MB.

Control Plane

This will be brought back in a later release with some enhancements.

1.3.2

kfuse-1.3.2 is a minor release update with performance improvements and couple of feature enhancements for our customers.

Metrics

On E2 machines metric segments used to take 4-10 minutes due to number of docs in each segment (~50M); We moved to columnar seal instead of row-by-row seal and seal times have come down by 50% or more.

Logs

kfuse stack can now ingest logs from Fluent-D directly. We support JSON and msg-pack format.

We now sort the log-lines on the disk based on their fingerprint. This results in better storage compression and improved grep and facet search performance.

Traces/APM

We now support flame-graph view for incomplete traces. In certain customer environments we may not get a root-span or parts of traces may always be missing due to environment setup. Improve flame-graph visualization to render such traces properly.

Moved to dictionary based encoding for span durations and bigger segments to improve query speeds.


1.3.1

kfuse-1.3.1 is a minor release update with support for better analytics.

Analytics

With the right instrumentation in place, Kloudfuse analytics can now do auto alerting and analysis. Auto alerting can be enabled easily for automatic monitoring of all Kubernetes services for anomalies on their RED metrics using Hawkeye. Auto analysis capability using Bullseye generates an analysis report with possible reasons for the alert (anomaly).


1.3.0

kfuse-1.3.0 release brings lots of improvements to the kfuse stack.

Metrics

Events

Logs

Traces/APM

Alerts

Infrastructure

Analytics