- do not initialize libipmctl package when getting an error from nvm_init()
- Don't fail permenantly when nvml isn't installed
- Update libpfm to 4.11.0
- Fix race between
OnDemandHousekeeping
andhousekeepingTick
- Fix timeout flooding issue after containerd restart
- Refactor process parsing to accommodate commands with spaces + Memory cgroup is not available on some systems
- Switch from k8s utils/mount to moby/sys mount
- Support nfs in processMounts
- Update docker/runc and a few other dependencies
- Add container_blkio_device_usage metric
- Update heuristic for container creation time
- Fix incorrect CPU topology on single NUMA and multi socket platform.
- Added support for filesystem metrics on Docker
- sched_getaffinity does not return number of online CPUs
- Add libipmctl to the docker image.
- Add cgroup_memory_migrate metric
- bump runc to v1.0.0-rc93
- Fix memory stats for cgroup v2
- Allow gathering of stats for root cgroup on v2
- Remove trailing \0 from values read from ppc64le device-tree
- Fix oomparser regex for kernels 5.0 and higher
- Handling arm64: topology and online information
- Bump golang to 1.16
- Bump containerd to 1.4.4
- Conditionally gathering FS usage metrics
- Cherrypick to v0.38 - Fix incorrect CPU topology on single NUMA and multi socket platform
- Cherrypick to v0.38 - sched_getaffinity does not return number of online CPUs
- Cherrypick to v0.37 - Fix incorrect CPU topology on single NUMA and multi socket platform
- Cherrypick to v0.37 - sched_getaffinity does not return number of online CPUs
- deps: Rollback grpc from v1.33.2 to v1.27.1
- do not initialize libipmctl package when getting an error from nvm_init()
- #1594 - chore: add storage_driver_buffer_duration in Influxdb storage docs
- #1924 - add hugepages info to attributes
- #2578 - Add perf event grouping.
- #2590 - Use current Docker registry
- #2611 - Aggregate perf metrics
- #2612 - Add stats to stdout storage
- #2618 - Update to containerd v1.4.0-beta.2 and runc v1.0.0-rc91
- #2621 - Memory numa stats
- #2627 - use Google Charts loader and not jsapi
- #2631 - Add entry for libpfm related tests to Makefile
- #2632 - Handling zeros in readPerfStat
- #2638 - Add stats to statsd storage
- #2639 - Add logs and simplify setup of raw perf events
- #2640 - Remove exclude guest flag from perf event attrs.
- #2644 - Use perf attributes from unix lib.
- #2646 - Fixed https proxy issue by installing 'full' wget in Docker alpine-based build stage
- #2655 - Update readme to point to discuss.kubernetes.io
- #2659 - Fix ordering of processes table
- #2665 - add clean operation when watchForNewContainers/Start failed
- #2669 - Update release documentation and process
- #2676 - Fix runtime error when there are no NVM devices.
- #2678 - Add checking checksum of libpfm4
- #2679 - Fix typo in libipmctl
- #2682 - Add missing flag to runtime_options.md
- #2683 - Add flags that were not previously published
- #2687 - Move mount library dependency from utils/mount to mount-utils
- #2689 - Increase the readability of perf event logs.
- #2690 - Try to read from sysfs before giving up on non-x86_64
- #2691 - Broken build configuration when custom build tags are used
- #2695 - Add information about limits of opened perf event files.
- #2697 - Update to new docker(v19.03.13) and containerd(1.4.1)
- #2702 - Increase golang ci lint timeout to 5 minutes
- #2706 - Add a badge for the current e2e test result
- #2707 - Fix Avoid random values in unix.PerfEventAttr{}
- #2711 - validateMemoryAccounting: fix for cgroup v2
- #2713 - Bump golang to 1.15
- #2714 - update docker client method
- #2716 - Update dependencies
- Add on-demand collection for prometheus metrics
- Fix detection of image filesystem
- Fix disk metrics for devicemapper devices
- Add NVM Power and NVM, Dimm, memory information to machine info
- Fix detection of OOM Kills on 5.0 linux kernels
- Add support for perf core and uncore event monitoring
- Add hugetlb container metrics
- Split into multiple go modules
- Add referenced memory metrics
- Publish images to gcr.io/cadvisor instead of gcr.io/google_containers
- Add socket id to numa topology in machine info
- Add resource control (Resctlr) metrics
- Add support for risc and mips CPUs
- Add advanced TCP stats
- Fix bug in which cAdvisor could fail to discover docker's root directory
- The stdout storage driver now supports metric timestamps
- Add ulimit metrics
- Support multi-arch container builds
- Switch to go modules
- Add hugepage info per-numa-node
- Add support for cgoups v2 unified higherarchy
- Drop support for rkt
- Fix a bug that prevented running with multiple tmpfs mounts
- Fix disk stats in LXD using ZFS storage pool
- Support monitoring non-k8s containerd namespaces
- The
storage_driver
flag now supports comma-separated inputs - Add
container_sockets
,container_threads
, andcontainer_threads_max
metrics - Fix CRI-O missing network metris bug
- Add
disable_root_cgroup_stats
flag to allow not collecting stats from the root cgroup.
- Add --raw_cgroup_prefix_whitelist flag to allow configuring which raw cgroup trees cAdvisor monitors
- Replace
du
andfind
with a golang implementation - Periodically update MachineInfo to support hot-add/remove
- Add explicit timestamps to prometheus metrics to fix rate calculations
- Add --url_base_prefix flag to provide better support for reverse proxies
- Add --white_listed_container_labels flag to allow specifying the container labels added as prometheus labels
- Add container process and file descriptor metrics (disabled by default)
- Rename
type
label tofailure_type
for prometheusmemory_failures_total
metric - Reduce mesos error logging when mesos not present
- Fix NVML initialization race condition
- Fix brtfs filesystem discovery
- Fix race condition with AllDockerContainers
- Don't watch .mount cgroups
- Reduce lock contention during list containers
- Don't produce prometheus metrics for ignored metrics
- Add option to not export container labels as prometheus labels
- Docs: Publish cAdvisor daemonset
- Docs: Add documentation for exported prometheus metrics
- Revert switch from inotify to fsnotify
- Use IONice to reduce IO priority of
du
andfind
- BREAKING API CHANGE: ContainerReference no longer contains Labels. Use ContainerSpec instead.
- Add schedstat metrics, disabled by default.
- Fix a bug where cadvisor failed to discover a sub-cgroup that was created soon after the parent cgroup.
- Disable per-cpu metrics by default for scalability
- Fix disk usage monitoring of overlayFs
- Retry docker connection on startup timeout
- Add timeout for docker calls
- Fix prometheus label consistency
- Fix GPU init race condition
- Add containerd support
- Fix fsnotify regression from 0.28.0
- Add on demand metrics
- Add container nvidia GPU metrics
- Expose container memory max_usage_in_bytes
- Add container memory reservation to prometheus
- Add CRI-O support
- Fix journalctl leak
- Fix container memory rss
- Add hugepages support
- Fix incorrect CPU usage with 4.7 kernel
- OOM parser uses kmsg
- Add tmpfs support
- Fix prometheus metrics.
- Fix disk partition discovery for brtfs
- Add ZFS support
- Add UDP metrics (collection disabled by default)
- Improve diskio prometheus metrics
- Update Prometheus godeps to v0.8
- Add overlay2 storage driver support
- Disable thin_ls due to excessive iops
- Ignore .mount cgroups, fixing dissappearing stats
- Fix wc goroutine leak
- Update aws-sdk-go dependency to 1.6.10
- Update to go 1.7 for releases
- Fix issue with running cAdvisor in a container on some distributions.
- Added host-level inode stats (total & available)
- Improved robustness to partial failures
- Metrics collector improvements
- Added ability to directly use endpoints from the container itself
- Allow SSL endpoint access
- Ability to provide a certificate which is exposed to custom endpoints
- Lots of bug fixes, including:
- Devicemapper thin_ls fixes
- Prometheus metrics fixes
- Fixes for missing stats (memory reservation, FS usage, etc.)
- Cherry-pick release:
- Ensure minimum kernel version for thin_ls
- Cherry-pick release:
- Prefix Docker labels & env vars in Prometheus metrics to prevent conflicts
- Cherry-pick release:
- Modify working set memory stats calculation
- Cherry-pick release:
- Updating inotify to fix memory leak v0.23 cherrypick
- Cherry-pick release:
- support LVM based device mapper storage drivers
- Cherry-pick release:
- Check for thin_is binary in path for devicemapper when using ThinPoolWatcher
- Fix uint64 overflow issue for CPU stats
- Cherry-pick release:
- Cap the maximum consecutive du commands
- Fix a panic when a prometheus endpoint ends with a newline
- Handle kernel log rotation
- More rkt support: poll rkt service for new containers
- Better handling of partial failures when fetching subcontainers
- Devicemapper thin_ls support (requires Device Mapper kernel module and supporting utilities)
- Add multi-container charts to the UI
- Add TLS options for Kafka storage driver
- Switch to official Docker client
- Systemd:
- Ignore .mount cgroups on systemd
- Better OOM monitoring
- Bug: Fix broken -disable_metrics flag
- Bug: Fix openstack identified as AWS
- Bug: Fix EventStore when limit is 0
- Docker v1.11 support
- Preliminary rkt support
- Bug: Fix file descriptor leak
- Disk usage calculation bug fixes
- Systemd integration bug fixes
- Instance ID support for Azure and AWS
- Limit number of custom metrics
- Support opt out for disk and network metrics
- Support for filesystem stats with docker v1.10
- Bug fixes.
- Breaking: Use uint64 for memory stats
- Bug: Fix devicemapper partition labelling
- Bug: Fix network stats when using new Docker network functionality
- Bug: Fix env var label mapping initialization
- Dependencies: libcontainer update
- Godep updates
- Bug fixes
- Jitter added to housekeeping to smooth CPU usage.
- New v2.1 API with better filesystem stats
- Internal refactoring
- Bug fixes.
- Large bunch of bug-fixes
- Fixed networking stats for newer docker versions using libnetwork.
- Added application-specific metrics
- Misc fixes.
- Fix longstanding memory leak.
- Fix UI on newest Chrome.
- Expose multiple network intefaces in UI and API.
- Add support for XFS.
- Fixes in inotify watches.
- Fixes on PowerPC machines.
- Fixes for newer systems with systemd.
- Extra debuging informaiton in /validate.
- Add process stats to container pages in the UI.
- Serve UI from relative paths (allows reverse proxying).
- Minor fixes to events API.
- Add bytes available to FS info.
- Adding Docker status and image information to UI.
- Basic Redis storage backend.
- Misc reliability improvements.
- Added
--docker_only
to limit monitoring to only Docker containers. - Added support for Docker labels.
- Added limit for events storage.
- Fixes for OOM event monitoring.
- Changed event type to a string in the API.
- Misc fixes.
- Added support for Docker 1.6.
- Split OOM event into OOM kill and OOM.
- Made EventData a concrete type in returned events.
- Enabled CPU load tracking (experimental).
- Export all stats as Prometheus metrics.
- Initial support for events: creation, deletion, and OOM.
- Adding machine UUID information.
- Beta release of the cAdvisor 2.0 API.
- Improve handling of error conditions.
- Misc fixes and improvements.
- Disable OOM monitoring which is using too much CPU.
- Fix break in summary stats.
- Adding Start and End time for ContainerInfoRequest.
- Various misc fixes.
- Support for more network devices (all non-eth).
- Support for more partition types (btrfs, device-mapper, whole-disk).
- Added reporting of DiskIO stats.
- Adding container creation time to ContainerSpec.
- More robust handling of stats failures.
- Various misc fixes.
- Added ethernet device information.
- Added machine-wide networking statistics.
- Misc UI fixes.
- Fixes for partially-isolated containers.
- Avoid repeated logging of container errors.
- Handle non identify mounts for cgroups.
- Support for HTTP basic auth.
- Added /validate to perform basic checks and determine support for cAdvisor.
- All stats in the UI are now updated.
- Added gauges for filesystem usage.
- Added device information to machine info.
- Fixes to container detection.
- Fixes for systemd detection.
- ContainerSpecs are now cached.
- Performance improvements.
- Fixes for Docker API and UI endpoints.
- Misc UI bugfixes.
- Bug fix in InfluxDB storage driver. Container name and hostname will be exported.
- Adding /docker UI endpoint for Docker containers.
- Fixes around handling Docker containers.
- Performance enhancements.
- Embed all external dependencies.
- ContainerStats Go struct has been flattened. The wire format remains unchanged.
- Misc bugfixes and cleanups.
- Added disk space stats. On by default for root, available on AUFS Docker containers.
- Introduced v1.2 remote API with new "docker" resource for Docker containers.
- Added "ContainerHints" file based interface to inject extra information about containers.
- Support for Docker containers in systemd systems.
- Adding DiskIO stats
- Misc bugfixes and cleanups
- Various performance enhancements: brings CPU usage down 85%+
- Implemented dynamic sampling through dynamic housekeeping.
- Memory storage driver is always on, BigQuery and InfluxDB are now optional storage backends.
- Fix for DNS resolution crashes when contacting InfluxDB.
- New containers are now detected using inotify.
- Added pprof HTTP endpoint.
- UI bugfixes.
- Support for Docker with LXC backend.
- Added BigQuery storage driver.
- Performance and stability fixes for InfluxDB storage driver.
- UI fixes and improvements.
- Configurable stats gathering interval (default: 1s).
- Improvements to startup and CPU utilization.
- Added /healthz endpoint for determining whether cAdvisor is healthy.
- Bugfixes and performance improvements.
- Improvements to influxDB plugin. Table name is now 'stats'. Network stats added. Detailed cpu and memory stats are no longer exported to reduce the load on the DB. Docker container alias now exported - It is now possible to aggregate stats across multiple nodes.
- Make the UI independent of the storage backend by caching recent stats in memory.
- Switched to glog.
- Bugfixes and performance improvements.
- Introduced v1.1 remote API with new "subcontainers" resource.
- Handle old Docker versions.
- UI fixes and other bugfixes.
- Added network stats to the UI.
- Added support for CoreOS and RHEL.
- Bugfixes and reliability fixes.
- Add network statistics to REST API.
- Add "raw" driver to handle non-Docker containers.
- Remove lmctfy in favor of the raw driver.
- Bugfixes for Docker containers and logging.
- Add support for systemd systems.
- Fixes for UI with InfluxDB storage driver.
- Added Storage Driver concept (flag: storage_driver), default is the in-memory driver
- Implemented InfluxDB storage driver
- Support in REST API for specifying number of stats to return
- Allow running without lmctfy (flag: allow_lmctfy)
- Bugfixes
- Support for container aliases
- Sampling historical usage and exporting that in the REST API
- Bugfixes for UI
- Initial version of cAdvisor
- Web UI with auto-updating stats
- v1.0 REST API with container and machine information
- Support for Docker containers
- Support for lmctfy containers