netdata v1.18.0 Release Notes

Release Date: 2019-10-10 // over 4 years ago
  • Netdata v1.18.0

    πŸ“š Release v1.18.0 contains 5 new collectors, 19 bug fixes, 28 improvements, and 20 documentation updates.

    At a glance

    The database engine is now the default method of storing metrics in Netdata. You immediately get more efficient and configurable long-term metrics storage without any work on your part. By saving recent metrics in RAM and "spilling" historical metrics to disk for long-term storage, the database engine is laying the foundation for many more improvements to distributed metrics.

    πŸ“„ We even have a tutorial on switching to the database engine and getting the most from it. Or, just read up on how performant the database engine really is.

    Both our python.d and go.d plugins now have more intelligent auto-detection by periodically dump a list of active modules to disk. When Netdata starts, such as after a reboot, the plugins use this list of known services to re-establish metrics collection much more reliably. No more worrying if the service or application you need to monitor starts up minutes after Netdata.

    Two of our new collectors will help those with Hadoop big data infrastructures. The HDFS and Zookeeper collection modules come with essential alarms requested by our community and Netdata's auto-detection capabilities to keep the required configuration to an absolute minimum. Read up on the process via our HDFS and Zookeeper tutorial.

    Speaking of new collectorsβ€”we also added the ability to collect metrics from SLAB cache, Gearman, and vCenter Server Appliances.

    Before v1.18, if you wanted to create alarms for each dimension in a single chart, you need to write separate entities for each dimensionβ€”not very efficient or user-friendly. New dimension templates fix that hassle. Now, a single entity can automatically generate alarms for any number of dimensions in a chart, even those you weren't aware of! Our tutorial on dimension templates has all the details.

    πŸ“š v1.18 brings support for installing Netdata on offline or air-gapped systems. To help users comply with strict security policies, our installation scripts can now install Netdata using previously-downloaded tarball and checksums instead of downloading them at runtime. We have guides for installing offline via kickstart.sh or kickstart-static64.sh in our installation documentation. We're excited to bring real-time monitoring to once-inaccessible systems!

    Acknowledgements

    Our thanks go to:

    πŸ‘Œ Improvements

    Database engine

    • 0️⃣ Make dbengine the default memory mode #6977 (mfundul)
    • 0️⃣ Increase dbengine default cache size #6997 (mfundul)
    • ⬇️ Reduce overhead during write IO #6964 (mfundul)
    • Detect deadlock in dbengine page cache #6911 (mfundul)
    • βœ‚ Remove hard cap from page cache size to eliminate deadlocks. #7006 (mfundul)

    πŸ†• New Collectors

    Collector improvements

    Health

    • Enable alarm templates for chart dimensions #6560 (thiagoftsm)
    • Center the chart on the proper chart and time whenever an alarm link is clicked #6391 (thiagoftsm)

    πŸ“¦ Installation/Packages

    πŸ“š Documentation

    • βž• Add Portuguese (Brazil) translation of the installation instructions #16(lets00), #7004 (cakrit)
    • πŸ›  Fix broken links found via linkchecker #6983 (joelhans)
    • πŸ”§ Clarification on configuring notification recipients #6961 (cakrit)
    • πŸ›  Fix Remark Lint for READMEs in database #6942, contrib #6921, daemon README #6920 and backends #6917 (prhomhyse)
    • Suggest using /run or /var/run for the unix socket #6916 (cakrit)
    • πŸ‘Œ Improve documentation for the SNMP collector #6915 (cakrit)
    • ⚑️ Update docs for offline install #6884 (paulkatsoulakis)
    • βœ‚ Remove Dollar sign from Bash code in documentation and fix remark-lint warnings #6880 (prhomhyse)
    • πŸ“œ Markdown syntax fixes for MDX parser #6877 (joelhans)
    • ⚑️ Update python.d module checklist to match the current paths and build system. #6874 (Ferroin)
    • βž• Add instructions for simple SMTP transport #6870 (cakrit)
    • βž• Add example for prometheus archiving source parameter #6869 (cakrit)
    • πŸ›  Fix broken links in the standard web dashboard doc #6854 (prhomhyse)
    • Overhaul of Getting started guide #6811 (joelhans)
    • ⚑️ NPM Packages version update #6801 (prhomhyse)
    • 🐎 Update suggested grep command in β€œhigh performance netdata” to be more specific #6794 (n0coast)

    Other

    • API: Include family into the allmetrics JSON response #6966 (ilyam8)
    • πŸ›  API: Add fixed width option to badges #6903 (underhood)
    • πŸ‘ Allow hostnames in Access Control Lists #6796 (amoss)
    • βœ… Functional test improvements for web and alarms tests #6783 (thiagoftsm)

    πŸ› Bug fixes

    • πŸ›  Fix issue error in alarm notification script, when executed without any arguments #7003 (OneCodeMonkey)
    • πŸ›  Fix Coverity warnings #6992 #6970#6941#6797 (thiagoftsm), #6909 (cakrit)
    • πŸ›  Fix dbengine consistency when a writer modifies a page concurrently with a reader querying its metrics #6979 (mfundul)
    • πŸ›  Fix memory leak on netdata exit #6945 (vlvkobal)
    • πŸ›  Fix for missing boundary data points in certain cases #6938 (mfundul)
    • πŸ›  Fix unhandled exception log warnings in the python.d collector orchestrator start\_job #6928 (ilyam8)
    • πŸ›  Fix CORS errors when accessing the health management API, by permitingt x-auth-token in Access-Control-Allow-Headers #6894 (cakrit)
    • πŸ›  Fix misleading error log entries RRDSET: chart name 'XXX' on host 'YYY' already exists, by changing the log level for chart updates #6887 (vlvkobal)
    • Properly resolve all Kubernetes container names #6885 (cakrit)
    • πŸ›  Fix LGTM warnings #6875 (jacekkolasa)
    • πŸ›  Fix agent UI redirect loop during cloud sign-in #6868 (jacekkolasa)
    • πŸ›  Fix /var/lib/netdata/registry getting left behind after uninstall #6867 (davent)
    • πŸ›  Fix python.d.plugin bug in parsing configuration files with no explicitly defined jobs #6856 (ilyam8)
    • πŸ›  Fix potential buffer overflow in the web server #6817 (amoss)
    • πŸ›  Fix netdata group deletion on linux for uninstall script #6645 (mbarper)
    • πŸ›  Various cppcheck fixes #6386 (ac000)
    • Fix crash on FreeBSD due to do_dev_cpu_temperature stack corruption #7014 (samm-git)
    • πŸ›  Fix handling of illegal metric timestamps in database engine #7008 (mfundul)
    • πŸ›  Fix a resource leak #7007 (vlvkobal)
    • πŸ›  Fix rabbitmq collector error when no vhosts are available. #7018 (mfundul)