diff --git a/README.md b/README.md index d2c3fe678..a756febea 100644 --- a/README.md +++ b/README.md @@ -4,101 +4,76 @@

-GitHub latest release - + Docs Docker pulls + + GitHub contributors GitHub Repo stars -GitHub Repo issues +
GitHub Repo issues GitHub Repo issues closed GitHub forks - - GitHub contributors +GitHub latest release +License GitHub contributors -License

+

- An open-source cloud-native monitoring system that is all-in-one
- Out-of-the-box, it integrates data collection, visualization, and monitoring alert
- We recommend upgrading your Prometheus + AlertManager + Grafana combination to Nightingale! + 告警管理专家,一体化的开源可观测平台

-[English](./README.md) | [中文](./README_zh.md) - +[English](./README_en.md) | [中文](./README.md) -## Highlighted Features +夜莺Nightingale是中国计算机学会接受捐赠并托管的第一个开源项目,是一个 All-in-One 的云原生监控工具,集合了 Prometheus 和 Grafana 的优点,你可以在 WebUI 上管理和配置告警策略,也可以对分布在多个 Region 的指标、日志、链路追踪数据进行统一的可视化和分析。夜莺融入了顶级互联网公司可观测性最佳实践,沉淀了众多社区专家经验,开箱即用[【了解更多】](https://flashcat.cloud/product/nightingale/) -- **Out-of-the-box** - - Supports multiple deployment methods such as **Docker, Helm Chart, and cloud services**, integrates data collection, monitoring, and alerting into one system, and comes with various monitoring dashboards, quick views, and alert rule templates. **It greatly reduces the construction cost, learning cost, and usage cost of cloud-native monitoring systems**. -- **Professional Alerting** - - Provides visual alert configuration and management, supports various alert rules, offers the ability to configure silence and subscription rules, supports multiple alert delivery channels, and has features such as alert self-healing and event management. -- **Cloud-Native** - - Quickly builds an enterprise-level cloud-native monitoring system through a turnkey approach, supports multiple collectors such as [Categraf](https://github.com/flashcatcloud/categraf), Telegraf, and Grafana-agent, supports multiple data sources such as Prometheus, VictoriaMetrics, M3DB, ElasticSearch, and Jaeger, and is compatible with importing Grafana dashboards. **It seamlessly integrates with the cloud-native ecosystem**. -- **High Performance and High Availability** - - Due to the multi-data-source management engine of Nightingale and its excellent architecture design, and utilizing a high-performance time-series database, it can handle data collection, storage, and alert analysis scenarios with billions of time-series data, saving a lot of costs. - - Nightingale components can be horizontally scaled with no single point of failure. It has been deployed in thousands of enterprises and tested in harsh production practices. Many leading Internet companies have used Nightingale for cluster machines with hundreds of nodes, processing billions of time-series data. -- **Flexible Extension and Centralized Management** - - Nightingale can be deployed on a 1-core 1G cloud host, deployed in a cluster of hundreds of machines, or run in Kubernetes. Time-series databases, alert engines, and other components can also be decentralized to various data centers and regions, balancing edge deployment with centralized management. **It solves the problem of data fragmentation and lack of unified views**. +## 资料 -#### If you are using Prometheus and have one or more of the following requirement scenarios, it is recommended that you upgrade to Nightingale: +- 文档:[flashcat.cloud/docs](https://flashcat.cloud/docs/) +- 提问:[answer.flashcat.cloud](https://answer.flashcat.cloud/) +- 反馈Bug:[github.com/ccfos/nightingale/issues](https://github.com/ccfos/nightingale/issues/new?assignees=&labels=kind%2Fbug&projects=&template=bug_report.yml) -- Multiple systems such as Prometheus, Alertmanager, Grafana, etc. are fragmented and lack a unified view and cannot be used out of the box; -- The way to manage Prometheus and Alertmanager by modifying configuration files has a big learning curve and is difficult to collaborate; -- Too much data to scale-up your Prometheus cluster; -- Multiple Prometheus clusters running in production environments, which faced high management and usage costs; -#### If you are using Zabbix and have the following scenarios, it is recommended that you upgrade to Nightingale: +## 功能和特点 -- Monitoring too much data and wanting a better scalable solution; -- A high learning curve and a desire for better efficiency of collaborative use in a multi-person, multi-team model; -- Microservice and cloud-native architectures with variable monitoring data lifecycles and high monitoring data dimension bases, which are not easily adaptable to the Zabbix data model; +- **统一接入各种时序库**:支持对接 Prometheus、VictoriaMetrics、Thanos、Mimir、M3DB 等多种时序库,实现统一告警管理 +- **专业告警能力**:内置支持多种告警规则,可以扩展支持所有通知媒介,支持告警屏蔽、告警抑制、告警自愈、告警事件管理 +- **高性能可视化引擎**:支持多种图表样式,内置众多Dashboard模版,也可导入Grafana模版,开箱即用,开源协议商业友好 +- **无缝搭配 [Flashduty](https://flashcat.cloud/product/flashcat-duty/)**:实现告警聚合收敛、认领、升级、排班、IM集成,确保告警处理不遗漏,减少打扰,更好协同 +- **支持所有常见采集器**:支持 [Categraf](https://flashcat.cloud/product/categraf)、Telegraf、Grafana-agent、Datadog-agent、各种 Exporter 作为采集器,没有什么数据是不能监控**的 +- **一体化观测平台**:从 v6 版本开始,支持接入 ElasticSearch、Jaeger 数据源,实现日志、链路、指标多维度的统一可观测 -#### If you are using [open-falcon](https://github.com/open-falcon/falcon-plus), we recommend you to upgrade to Nightingale: -- For more information about open-falcon and Nightingale, please refer to read [Ten features and trends of cloud-native monitoring](https://mp.weixin.qq.com/s?__biz=MzkzNjI5OTM5Nw==&mid=2247483738&idx=1&sn=e8bdbb974a2cd003c1abcc2b5405dd18&chksm=c2a19fb0f5d616a63185cd79277a79a6b80118ef2185890d0683d2bb20451bd9303c78d083c5#rd)。 +## 产品演示 -## Getting Started +![演示](doc/img/n9e-screenshot-gif-v6.gif) -[https://n9e.github.io/](https://n9e.github.io/) +## 部署架构 -## Screenshots +![架构](doc/img/n9e-arch-latest.png) -https://user-images.githubusercontent.com/792850/216888712-2565fcea-9df5-47bd-a49e-d60af9bd76e8.mp4 +## 交流群 -## Architecture - - - -Nightingale monitoring can receive monitoring data reported by various collectors (such as [Categraf](https://github.com/flashcatcloud/categraf) , telegraf, grafana-agent, Prometheus, etc.) and write them to various popular time-series databases (such as Prometheus, M3DB, VictoriaMetrics, Thanos, TDEngine, etc.). It provides configuration capabilities for alert rules, silence rules, and subscription rules, as well as the ability to view monitoring data. It also provides automatic alarm self-healing mechanisms (such as automatically calling back to a webhook address or executing a script after an alarm is triggered), and the ability to store and manage historical alarm events and view them in groups. - -If the performance of a standalone time-series database (such as Prometheus) has bottlenecks or poor disaster recovery, we recommend using [VictoriaMetrics](https://github.com/VictoriaMetrics/VictoriaMetrics). The VictoriaMetrics architecture is relatively simple, has excellent performance, and is easy to deploy and maintain. The architecture diagram is as shown above. For more detailed documentation on VictoriaMetrics, please refer to its [official website](https://victoriametrics.com/). - -**We welcome you to participate in the Nightingale open-source project and community in various ways, including but not limited to**: -- Adding and improving documentation => [n9e.github.io](https://n9e.github.io/) -- Sharing your best practices and experience in using Nightingale monitoring => [Article sharing]((https://n9e.github.io/docs/prologue/share/)) -- Submitting product suggestions => [github issue](https://github.com/ccfos/nightingale/issues/new?assignees=&labels=kind%2Ffeature&template=enhancement.md) -- Submitting code to make Nightingale monitoring faster, more stable, and easier to use => [github pull request](https://github.com/didi/nightingale/pulls) - - -**Respecting, recognizing, and recording the work of every contributor** is the first guiding principle of the Nightingale open-source community. We advocate effective questioning, which not only respects the developer's time but also contributes to the accumulation of knowledge in the entire community -- Before asking a question, please first refer to the [FAQ](https://www.gitlink.org.cn/ccfos/nightingale/wiki/faq) -- We use [GitHub Discussions](https://github.com/ccfos/nightingale/discussions) as the communication forum. You can search and ask questions here. -- We also recommend that you join ours [Slack channel](https://n9e-talk.slack.com/) to exchange experiences with other Nightingale users. +1. 问题讨论,优先推荐访问[夜莺Answer论坛](https://answer.flashcat.cloud/); +2. 反馈Bug,优先推荐通过提交[夜莺Github Issue](https://github.com/ccfos/nightingale/issues/new?assignees=&labels=kind%2Fbug&projects=&template=bug_report.yml) +3. 推荐浏览[夜莺文档站点](https://flashcat.cloud/docs/),了解更多信息; +4. 推荐搜索关注夜莺公众号:**夜莺监控Nightingale** +4. 欢迎加入 QQ 交流群,群号:479290895,群友互助; +## Stargazers over time -## Who is using Nightingale -You can register your usage and share your experience by posting on **[Who is Using Nightingale](https://github.com/ccfos/nightingale/issues/897)**. +[![Stargazers over time](https://api.star-history.com/svg?repos=ccfos/nightingale&type=Date)](https://star-history.com/#ccfos/nightingale&Date) -## Stargazers over time -[![Stargazers over time](https://starchart.cc/ccfos/nightingale.svg)](https://starchart.cc/ccfos/nightingale) ## Contributors +## 社区治理 +[夜莺开源项目和社区治理架构(草案)](./doc/community-governance.md) + ## License -[Apache License V2.0](https://github.com/didi/nightingale/blob/main/LICENSE) \ No newline at end of file +[Apache License V2.0](https://github.com/didi/nightingale/blob/main/LICENSE) diff --git a/README_en.md b/README_en.md new file mode 100644 index 000000000..6e446c3b7 --- /dev/null +++ b/README_en.md @@ -0,0 +1,104 @@ +

+ + nightingale - cloud native monitoring +

+ +

+GitHub latest release + + Docs + + Docker pulls +GitHub Repo stars +GitHub Repo issues +GitHub Repo issues closed +GitHub forks + + GitHub contributors + + GitHub contributors +License +

+

+ An open-source cloud-native monitoring system that is all-in-one
+ Out-of-the-box, it integrates data collection, visualization, and monitoring alert
+ We recommend upgrading your Prometheus + AlertManager + Grafana combination to Nightingale! +

+ +[English](./README_en.md) | [中文](./README.md) + + +## Highlighted Features + +- **Out-of-the-box** + - Supports multiple deployment methods such as **Docker, Helm Chart, and cloud services**, integrates data collection, monitoring, and alerting into one system, and comes with various monitoring dashboards, quick views, and alert rule templates. **It greatly reduces the construction cost, learning cost, and usage cost of cloud-native monitoring systems**. +- **Professional Alerting** + - Provides visual alert configuration and management, supports various alert rules, offers the ability to configure silence and subscription rules, supports multiple alert delivery channels, and has features such as alert self-healing and event management. +- **Cloud-Native** + - Quickly builds an enterprise-level cloud-native monitoring system through a turnkey approach, supports multiple collectors such as [Categraf](https://github.com/flashcatcloud/categraf), Telegraf, and Grafana-agent, supports multiple data sources such as Prometheus, VictoriaMetrics, M3DB, ElasticSearch, and Jaeger, and is compatible with importing Grafana dashboards. **It seamlessly integrates with the cloud-native ecosystem**. +- **High Performance and High Availability** + - Due to the multi-data-source management engine of Nightingale and its excellent architecture design, and utilizing a high-performance time-series database, it can handle data collection, storage, and alert analysis scenarios with billions of time-series data, saving a lot of costs. + - Nightingale components can be horizontally scaled with no single point of failure. It has been deployed in thousands of enterprises and tested in harsh production practices. Many leading Internet companies have used Nightingale for cluster machines with hundreds of nodes, processing billions of time-series data. +- **Flexible Extension and Centralized Management** + - Nightingale can be deployed on a 1-core 1G cloud host, deployed in a cluster of hundreds of machines, or run in Kubernetes. Time-series databases, alert engines, and other components can also be decentralized to various data centers and regions, balancing edge deployment with centralized management. **It solves the problem of data fragmentation and lack of unified views**. + + +#### If you are using Prometheus and have one or more of the following requirement scenarios, it is recommended that you upgrade to Nightingale: + +- Multiple systems such as Prometheus, Alertmanager, Grafana, etc. are fragmented and lack a unified view and cannot be used out of the box; +- The way to manage Prometheus and Alertmanager by modifying configuration files has a big learning curve and is difficult to collaborate; +- Too much data to scale-up your Prometheus cluster; +- Multiple Prometheus clusters running in production environments, which faced high management and usage costs; + +#### If you are using Zabbix and have the following scenarios, it is recommended that you upgrade to Nightingale: + +- Monitoring too much data and wanting a better scalable solution; +- A high learning curve and a desire for better efficiency of collaborative use in a multi-person, multi-team model; +- Microservice and cloud-native architectures with variable monitoring data lifecycles and high monitoring data dimension bases, which are not easily adaptable to the Zabbix data model; + + +#### If you are using [open-falcon](https://github.com/open-falcon/falcon-plus), we recommend you to upgrade to Nightingale: +- For more information about open-falcon and Nightingale, please refer to read [Ten features and trends of cloud-native monitoring](https://mp.weixin.qq.com/s?__biz=MzkzNjI5OTM5Nw==&mid=2247483738&idx=1&sn=e8bdbb974a2cd003c1abcc2b5405dd18&chksm=c2a19fb0f5d616a63185cd79277a79a6b80118ef2185890d0683d2bb20451bd9303c78d083c5#rd)。 + +## Getting Started + +[https://n9e.github.io/](https://n9e.github.io/) + +## Screenshots + +https://user-images.githubusercontent.com/792850/216888712-2565fcea-9df5-47bd-a49e-d60af9bd76e8.mp4 + +## Architecture + + + +Nightingale monitoring can receive monitoring data reported by various collectors (such as [Categraf](https://github.com/flashcatcloud/categraf) , telegraf, grafana-agent, Prometheus, etc.) and write them to various popular time-series databases (such as Prometheus, M3DB, VictoriaMetrics, Thanos, TDEngine, etc.). It provides configuration capabilities for alert rules, silence rules, and subscription rules, as well as the ability to view monitoring data. It also provides automatic alarm self-healing mechanisms (such as automatically calling back to a webhook address or executing a script after an alarm is triggered), and the ability to store and manage historical alarm events and view them in groups. + +If the performance of a standalone time-series database (such as Prometheus) has bottlenecks or poor disaster recovery, we recommend using [VictoriaMetrics](https://github.com/VictoriaMetrics/VictoriaMetrics). The VictoriaMetrics architecture is relatively simple, has excellent performance, and is easy to deploy and maintain. The architecture diagram is as shown above. For more detailed documentation on VictoriaMetrics, please refer to its [official website](https://victoriametrics.com/). + +**We welcome you to participate in the Nightingale open-source project and community in various ways, including but not limited to**: +- Adding and improving documentation => [n9e.github.io](https://n9e.github.io/) +- Sharing your best practices and experience in using Nightingale monitoring => [Article sharing]((https://n9e.github.io/docs/prologue/share/)) +- Submitting product suggestions => [github issue](https://github.com/ccfos/nightingale/issues/new?assignees=&labels=kind%2Ffeature&template=enhancement.md) +- Submitting code to make Nightingale monitoring faster, more stable, and easier to use => [github pull request](https://github.com/didi/nightingale/pulls) + + +**Respecting, recognizing, and recording the work of every contributor** is the first guiding principle of the Nightingale open-source community. We advocate effective questioning, which not only respects the developer's time but also contributes to the accumulation of knowledge in the entire community +- Before asking a question, please first refer to the [FAQ](https://www.gitlink.org.cn/ccfos/nightingale/wiki/faq) +- We use [GitHub Discussions](https://github.com/ccfos/nightingale/discussions) as the communication forum. You can search and ask questions here. +- We also recommend that you join ours [Slack channel](https://n9e-talk.slack.com/) to exchange experiences with other Nightingale users. + + +## Who is using Nightingale +You can register your usage and share your experience by posting on **[Who is Using Nightingale](https://github.com/ccfos/nightingale/issues/897)**. + +## Stargazers over time +[![Stargazers over time](https://starchart.cc/ccfos/nightingale.svg)](https://starchart.cc/ccfos/nightingale) + +## Contributors + + + + +## License +[Apache License V2.0](https://github.com/didi/nightingale/blob/main/LICENSE) \ No newline at end of file diff --git a/README_zh.md b/README_zh.md deleted file mode 100644 index 3e23e0596..000000000 --- a/README_zh.md +++ /dev/null @@ -1,74 +0,0 @@ -

- - nightingale - cloud native monitoring -

- -

- - Docs - - Docker pulls - - GitHub contributors -GitHub Repo stars -
GitHub Repo issues -GitHub Repo issues closed -GitHub forks -GitHub latest release -License - - GitHub contributors -

- -

- 告警管理专家,一体化的开源可观测平台 -

- -[English](./README.md) | [中文](./README_zh.md) - -夜莺Nightingale是中国计算机学会托管的开源云原生可观测工具,最早由滴滴于 2020 年孵化并开源,并于 2022 年正式捐赠予中国计算机学会。夜莺采用 All-in-One 的设计理念,集数据采集、可视化、监控告警、数据分析于一体,与云原生生态紧密集成,融入了顶级互联网公司可观测性最佳实践,沉淀了众多社区专家经验,开箱即用。 - -## 资料 - -- 文档:[flashcat.cloud/docs](https://flashcat.cloud/docs/) -- 提问:[answer.flashcat.cloud](https://answer.flashcat.cloud/) -- 报Bug:[github.com/ccfos/nightingale/issues](https://github.com/ccfos/nightingale/issues/new?assignees=&labels=kind%2Fbug&projects=&template=bug_report.yml) - - -## 功能和特点 - -- 统一接入各种时序库:支持对接 Prometheus、VictoriaMetrics、Thanos、Mimir、M3DB 等多种时序库,实现统一告警管理 -- 专业告警能力:内置支持多种告警规则,可以扩展支持所有通知媒介,支持告警屏蔽、告警抑制、告警自愈、告警事件管理 -- 高性能可视化引擎:支持多种图表样式,内置众多Dashboard模版,也可导入Grafana模版,开箱即用,开源协议商业友好 -- 无缝搭配 [Flashduty](https://flashcat.cloud/product/flashcat-duty/):实现告警聚合收敛、认领、升级、排班、IM集成,确保告警处理不遗漏,减少打扰,更好协同 -- 支持所有常见采集器:支持 [Categraf](https://flashcat.cloud/product/categraf)、telegraf、grafana-agent、datadog-agent、各种 exporter 作为采集器,没有什么数据是不能监控的 -- 一体化观测平台:从 v6 版本开始,支持接入 ElasticSearch、Jaeger 数据源,实现日志、链路、指标多维度的统一可观测 - - -## 产品演示 - -![演示](doc/img/n9e-screenshot-gif-v6.gif) - -## 部署架构 - -![架构](doc/img/n9e-arch-latest.png) - -## 加入交流群 - -欢迎加入 QQ 交流群,群号:479290895,QQ 群适合群友互助,夜莺研发人员通常不在群里。如果要报 bug 请到[这里](https://github.com/ccfos/nightingale/issues/new?assignees=&labels=kind%2Fbug&projects=&template=bug_report.yml),提问到[这里](https://answer.flashcat.cloud/)。 - -## Stargazers over time - -[![Stargazers over time](https://api.star-history.com/svg?repos=ccfos/nightingale&type=Date)](https://star-history.com/#ccfos/nightingale&Date) - - -## Contributors - - - - -## 社区治理 -[夜莺开源项目和社区治理架构(草案)](./doc/community-governance.md) - -## License -[Apache License V2.0](https://github.com/didi/nightingale/blob/main/LICENSE)