-
Notifications
You must be signed in to change notification settings - Fork 588
HDDS-8325. Consolidate and refine RocksDB metrics of services #4506
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
The findbug issue is irrelevant. File TestOzoneBlockTokenIdentifier is not updated in this patch. |
|
Hi @symious, could you help to review this patch at your convenient time? |
|
cc @tanvipenumudy can you please take a look as well? |
|
LGTM. I see the name of the metric is "Hadoop:service=StorageContainerManager,name=Rocksdb_scm.db", does that mean the metrics only have SCM db's metrics? |
Yes, "Hadoop:service=StorageContainerManager,name=Rocksdb_scm.db" will have all exposed rocksdb metrics for SCM DB. |
tanvipenumudy
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The changes look good to me @ChenSammi, thanks.
I don't understand why this findbugs issue is not flagged on other commits and PRs. Fixing it in #4517. |
|
Thanks @symious and @tanvipenumudy for the code review. |
* master: (155 commits) update readme (apache#4535) HDDS-8374. Disable flaky unit test: TestContainerStateCounts HDDS-8016. updated the ozone doc for linked bucket and deletion async limitation (apache#4526) HDDS-8237. [Snapshot] loadDb() used by SstFiltering service creates extraneous directories. (apache#4446) HDDS-8035. Intermittent timeout in TestOzoneManagerHAWithData.testOMHAMetrics (apache#4362) HDDS-8039. Allow container inspector to run from ozone debug. (apache#4337) HDDS-8304. [Snapshot] Reduce flakiness in testSkipTrackingWithZeroSnapshot (apache#4487) HDDS-7974. [Snapshot] KeyDeletingService to be aware of Ozone snapshots (apache#4486) HDDS-8368. ReplicationManager: Create ContainerReplicaOp with correct target Datanode (apache#4532) HDDS-8358. Fix the space usage comparator in ContainerBalancerSelectionCriteria (apache#4527) HDDS-8359. ReplicationManager: Fix getContainerReplicationHealth() so that it builds ContainerCheckRequest correctly (apache#4528) HDDS-8361. Useless object in TestOzoneBlockTokenIdentifier (apache#4517) HDDS-8325. Consolidate and refine RocksDB metrics of services (apache#4506) HDDS-8135. Incorrect synchronization during certificate renewal in DefaultCertificateClient. (apache#4381) HDDS-8127. Exclude deleted containers from Recon container count (apache#4440) HDDS-8364. ReadReplicas may give wrong results with topology-aware read enabled (apache#4522) HDDS-8354. Avoid WARNING about ObjectEndpoint#get (apache#4515) HDDS-8324. DN data cache gets removed randomly asking for data from disk (apache#4499) HDDS-8291. Upgrade to Hadoop 3.3.5 (apache#4484) HDDS-8355. Mark TestOMRatisSnapshots#testInstallSnapshot as flaky ...
https://issues.apache.org/jira/browse/HDDS-8325
How was this patch tested?
manual test the patch by checking the /jmx and /prom output of each service.
Before the patch, for every rocksdb instance, there will be two metrics group in jmx, for example, SCM will have these two metrics group,
After the patch, SCM will have one metrics group for rocksdb,
Before patch,
After patch
