Replies: 2 comments 5 replies
-
Version/image:skywalking-oap-server:8.9.1 |
Beta Was this translation helpful? Give feedback.
-
Hi, i'm using the latest version of Apache Skywalking. |
Beta Was this translation helpful? Give feedback.
-
Phenomena
The selected first getAddress is xxx.xxx.xx.xx:port. The remove stage is skipped.
table: xxx does not exist. OAP is running in 'no-init' mode, waiting... retry 3s later.
Root cause
This bug exists for many years, back to Oct. 2018. TTL timer expected
queryRemoteNodes
always returns a certain ordered OAP instance list, which makes one OAP node would be selected to take the responsibility of removing expired indices and create the latest(today's) indices when rolling.But, typically and proved, when using k8s coordinator, the k8s coordinator would not return an order instance list, which could have no OAP nodes selected, and the TTL timer would not really work in any case.
In this case, most indices could be created normally as new telemetry data would trigger index creation automatically. But
Fix
The pull request to fix this is #9632.
Beta Was this translation helpful? Give feedback.
All reactions