Assumptions
If using an existing dd-agent version (and just pointing it to KloudFuse instead of DD HQ)
dd agent version needs to be 7.41 or higher.
validate using following steps:
check the chart version with helm list - should be 3.1.10 or higher
helm list -n <namespace-where-agent-is-installed>
check the image version of the agent with describe pod on dd-agent pod.
k describe pod -n <namespace-where-agent-is-installed> | grep Image Image: gcr.io/datadoghq/agent:7.36.0
Install DataDog Agent
Agent Scenario \ Cluster Scenario | Kloudfuse stack & target both in same VPC and in same K8S cluster | Kloudfuse stack & target both in same VPC, but in different K8S cluster | Kloudfuse stack hosted in a different VPC (hosted at “customer.kloudfuse.io”) |
---|---|---|---|
Existing or new agent |
|
|
|
Required Datadog configurations
The yaml files described above already contains the required Datadog configurations for Kloudfuse to ingest the data properly. These required changes are listed below:
datadog: logsEnabled: true logs: enabled: true containerCollectAll: true containerCollectUsingFiles: true autoMultiLineDetection: true kubeStateMetricsEnabled: false kubeStateMetricsCore: enabled: true ignoreLegacyKSMCheck: true orchestratorExplorer: enabled: true processAgent: enabled: true prometheusScrape: enabled: true version: 1 additionalConfigs: - configurations: - send_monotonic_counter: false send_distribution_counts_as_monotonic: false send_distribution_sums_as_monotonic: false send_histograms_buckets: true max_returned_metrics: 999999 min_collection_interval: 15 clusterAgent: enabled: true datadog_cluster_yaml: process_config: container_collection: enabled: false orchestrator_explorer: manifest_collection: enabled: false use_v2_api: events: true series: true service_checks: true admissionController: enabled: false agents: image: tagSuffix: jmx useConfigMap: true customAgentConfig: skip_ssl_validation: false enable_stream_payload_serialization: false process_config: container_collection: enabled: false orchestrator_explorer: manifest_collection: enabled: false use_v2_api: events: true series: true service_checks: true logs_config: use_http: true logs_no_ssl: true auto_multi_line_detection: true use_v2_api: false apm_config: enabled: true apm_non_local_traffic: true metadata_providers: - name: host interval: 300
Collect high cardinality tags
The Datadog Agent is installed with cardinality set to ‘orchestrator’ level - allowing granular (pod and container level) tagging of metrics.. The default setting in the Datadog Agent is set to ‘low’ - allowing tagging only at host level.
Install command
If you haven’t before, add datadog helm repo:
helm repo add datadog https://helm.datadoghq.com helm repo update
Install the datadog agent with updated values (cmd below assumes the file is called dd-values-kfuse.yaml. Replace that argument with the filename that’s relevant to your scenario).
# Create separate namespace for the agent to be installed (if required) kubectl create namespace kfuse-agent helm upgrade --install kfuse-agent -f dd-values-kfuse.yaml datadog/datadog -n kfuse-agent --version 3.6.7
Adding custom tags
Custom tags can be added to the agent such that all metrics collected by the agent will have the tag added. To add custom tags, please follow these steps.
Update
dd-values-kfuse.yaml
file to include following for the custom tags:
datadog: tags: - custom_tag_name: custom_tag_value
custom-values.yaml
file needs to be updated to whitelist the new custom tag (note that the following entry needs to be “appended” to the existing list, without which some default values will get overwritten which is not the desired behavior. Please contact the Kloudfuse team for assistance):
ingester: config: hostTagIncludes: - kfuse - kube_cluster_name - kubernetes.io/hostname - node.kubernetes.io/instance-type - org_id - project - topology.kubernetes.io/region - topology.kubernetes.io/zone - custom_tag_name
Advance monitoring: Enable kubernetes_state_metrics
Advance monitoring (Kfuse 1.3 or higher) currently has dependency on kubernetes_state_metrics (KSM) check which is not enabled in the newer version of the agent (2.0) by default. Please ensure that the agent continues to capture these metrics through KSM. To do that, please add/update the dd-agent values file as follows:
datadog: kubeStateMetricsEnabled: true kubeStateMetricsCore: enabled: true ignoreLegacyKSMCheck: false