Story image

Amazon CloudWatch adds custom metrics support

01 Oct 18

Amazon CloudWatch Agent now supports the ability to publish custom StatsD or collectd metrics to CloudWatch. 

Businesses can leverage these custom metrics to create alarms for triggering notifications and auto-scaling actions or save them to dashboards for quick viewing in CloudWatch. 

StatsD and collectd are popular, open-source solutions that gather system statistics for a wide variety of applications. CloudWatch Agent enables companies to publish and store custom StatsD and collectd metrics for up to 15 months in CloudWatch.

Businesses can also choose to publish these custom metrics to an account other than the resource account where the agent is collecting metrics, such as a central monitoring account.

They can get started with the CloudWatch agent by downloading directly from the AWS SSM console or via CLI from our S3 bucket for standalone installs. To learn more, please visit the CloudWatch agent user guide for StatsD and collectd. 

The CloudWatch agent is available in all AWS public regions, including AWS GovCloud. 

Collectd is a daemon which collects system and application performance metrics periodically and provides mechanisms to store the values in a variety of ways, for example in RRD files.

Collectd gathers metrics from various sources, for example, the operating system, applications, log files and external devices, and stores this information or makes it available over the network. 

Those statistics can be used to monitor systems, find performance bottlenecks (i.e. performance analysis) and predict future system load (i.e. capacity planning).

StatsD is a network daemon that runs on the Node.js platform and listens for statistics, like counters and timers, sent over UDP or TCP and sends aggregates to one or more pluggable backend services (e.g. Graphite).

StatsD was inspired by the project (of the same name) at Flickr.

StatsD Overview: 

1. Each stat is in its own "bucket". They are not predefined anywhere. Buckets can be named anything that will translate to Graphite.

2. Each stat will have a value. How it is interpreted depends on modifiers. In general, values should be an integer.

3. After the flush interval timeout (defined by config.flushInterval, default 10 seconds), stats are aggregated and sent to an upstream backend service.

Will 2019 be the year of network evolution?
An A10 Networks exec talks 5G, software-defined networks, and the continuing evolution needed for a modern cloud environment.
ZTE takes the lead in the global race to 5G
ZTE took the lead in completing the IMT-2020 third phase 5G test for core network performance stability and security function.
IDC: Relevance is combining strategy, creativity and IT services
IDC reveals the Top 10 Asia/Pacific predictions to impact IT and business services sourcing in 2019 and beyond.
How IIoT is creating opportunities for RFID companies
The growing demands for automation and digitisation are creating considerable growth opportunities for RFID vendors.
Huawei founder publically denies spying allegations
“After all the evidence is made public, we will rely on the justice system.”
Malware downloader on the rise in Check Point’s latest Threat Index
Organisations continue to be targeted by cryptominers, despite an overall drop in value across all cryptocurrencies in 2018.
Exclusive: Why Australia’s IT industry needs to invest in SMBs
"With SMBs generating employment for over five million Australians, it comes as no surprise that they play a vital role in the nation’s economy."
IoT breaches: Nearly half of businesses still can’t detect them
The Internet of Thing’s (IoT’s) rapid rise to prominence may have compromised its security, if a new report from Gemalto is anything to go by.