Fix the IOStats computation #1710

Thejas-bhat · 2022-07-15T03:24:51Z

The newly introduced stats:
num_bytes_read_at_query_time - computes the bytes read from disk while query
num_bytes_indexed_after_analysis track the bytes read - computes the bytes written to disk while indexing

are mainly used to track the disk utilisation for the current index. Currently, the num_bytes_indexed_after_analysis (now changing to num_bytes_written_at_index_time) considered only the total bytes of the tokens after the analysis of a field's content. However, a user can further store a field, enable doc values to be stored, enable location information of a term to be stored or even include the field's content in _all field. All these options incur additional cost in terms of disk utilisation and have to be considered in the stats. The PRs #119 and #125 and the current one aim to achieve these changes.

abhinavdangeti · 2022-07-15T18:50:28Z

index/scorch/scorch.go

 		result.VisitFields(func(f index.Field) {
-			atomic.AddUint64(&s.stats.TotBytesIndexedAfterAnalysis,
-				analysisBytes(f.AnalyzedTokenFrequencies()))
+			if segment.CollectIOStats {


I would recommend against adding this flag in scorch_segment_api.
Instead make it a scorch config option and apply this only while -

Estimating usage within bleve

Reporting usage reported by zap again from bleve

Leave zap's computation on always.

abhinavdangeti · 2022-07-25T17:57:30Z

Unit test TestBytesWritten is flaky in the results it expects -

[11:56:02] AD: ~/Documents/go/src/github.com/blevesearch/bleve $ go test -run=TestBytesWritten -race
PASS
ok  	github.com/blevesearch/bleve/v2	2.217s
[11:56:18] AD: ~/Documents/go/src/github.com/blevesearch/bleve $ go test -run=TestBytesWritten
--- FAIL: TestBytesWritten (0.49s)
    index_test.go:314: expected bytes written is 55471, got 92127
FAIL
exit status 1
FAIL	github.com/blevesearch/bleve/v2	0.762s
[11:56:24] AD: ~/Documents/go/src/github.com/blevesearch/bleve $ go test -run=TestBytesWritten
--- FAIL: TestBytesWritten (0.83s)
    index_test.go:338: expected bytes written is 62347, got 117818
FAIL
exit status 1
FAIL	github.com/blevesearch/bleve/v2	1.068s
[11:56:27] AD: ~/Documents/go/src/github.com/blevesearch/bleve $ go test -run=TestBytesWritten
--- FAIL: TestBytesWritten (1.08s)
    index_test.go:363: expected bytes written is 102602, got 164949
FAIL
exit status 1
FAIL	github.com/blevesearch/bleve/v2	1.322s
[11:56:31] AD: ~/Documents/go/src/github.com/blevesearch/bleve $ go test -run=TestBytesWritten
PASS
ok  	github.com/blevesearch/bleve/v2	1.326s

Thejas-bhat · 2022-08-05T05:39:29Z

Turns out,TestBytesWritten behaviour was flaky mainly because of a memory leak issue in zapx which has been addressed in zapx PR #125

sreekanth-cb · 2022-08-05T05:38:04Z

index_test.go

+
+	contentFieldMapping.IncludeInAll = true
+	tmpIndexPath2 := createTmpIndexPath(t)
+


clean up of this path is missing?
Also, should we need to defer it as we could try it right after the index close? (doesn't matter much)
It could also be written like a table-driven test since the variations across run are only a few values.

abhinavdangeti · 2022-08-08T15:23:27Z

index_test.go

+	idx.Close()
+
+	return statValError
+}


Insert a new line at the end of a method.

abhinavdangeti · 2022-08-08T15:23:34Z

index_test.go

+		t.Fatal(err)
+	}
+	cleanupTmpIndexPath(t, tmpIndexPath4)
+}


Thejas-bhat requested review from abhinavdangeti and sreekanth-cb July 15, 2022 03:24

abhinavdangeti requested changes Jul 15, 2022

View reviewed changes

Thejas-bhat force-pushed the statsUnderFlag branch 3 times, most recently from d359c60 to ff04e75 Compare July 20, 2022 14:30

abhinavdangeti added this to the v2.3.4 milestone Jul 20, 2022

abhinavdangeti changed the title ~~Making the IOStats computation optional~~ Fix the IOStats computation Jul 22, 2022

abhinavdangeti force-pushed the statsUnderFlag branch from ca57327 to 1f157d2 Compare July 22, 2022 17:31

abhinavdangeti marked this pull request as ready for review July 22, 2022 17:32

Improving the accounting of bytes written to disk stats

f8f966f

Thejas-bhat force-pushed the statsUnderFlag branch from 0b7fd9d to f8f966f Compare August 4, 2022 12:57

Thejas-bhat requested a review from metonymic-smokey August 5, 2022 05:40

sreekanth-cb reviewed Aug 5, 2022

View reviewed changes

Thejas-bhat force-pushed the statsUnderFlag branch from 5555143 to 2acbf6a Compare August 8, 2022 13:35

abhinavdangeti reviewed Aug 8, 2022

View reviewed changes

including disk stats computation bug fixes from zapx

351885c

Thejas-bhat force-pushed the statsUnderFlag branch from 2acbf6a to 351885c Compare August 8, 2022 15:28

abhinavdangeti approved these changes Aug 8, 2022

View reviewed changes

sreekanth-cb approved these changes Aug 10, 2022

View reviewed changes

Thejas-bhat merged commit d89c6c0 into blevesearch:master Aug 10, 2022

abhinavdangeti mentioned this pull request Sep 22, 2022

Significant query throughput drop observed with v2.3.4 #1731

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix the IOStats computation #1710

Fix the IOStats computation #1710

Uh oh!

Thejas-bhat commented Jul 15, 2022 •

edited

Loading

Uh oh!

abhinavdangeti Jul 15, 2022

Uh oh!

abhinavdangeti commented Jul 25, 2022

Uh oh!

Thejas-bhat commented Aug 5, 2022

Uh oh!

sreekanth-cb Aug 5, 2022

Uh oh!

Thejas-bhat Aug 8, 2022

Uh oh!

abhinavdangeti Aug 8, 2022

Uh oh!

abhinavdangeti Aug 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		contentFieldMapping.IncludeInAll = true
		tmpIndexPath2 := createTmpIndexPath(t)

Fix the IOStats computation #1710

Fix the IOStats computation #1710

Uh oh!

Conversation

Thejas-bhat commented Jul 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abhinavdangeti Jul 15, 2022

Choose a reason for hiding this comment

Uh oh!

abhinavdangeti commented Jul 25, 2022

Uh oh!

Thejas-bhat commented Aug 5, 2022

Uh oh!

sreekanth-cb Aug 5, 2022

Choose a reason for hiding this comment

Uh oh!

Thejas-bhat Aug 8, 2022

Choose a reason for hiding this comment

Uh oh!

abhinavdangeti Aug 8, 2022

Choose a reason for hiding this comment

Uh oh!

abhinavdangeti Aug 8, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Thejas-bhat commented Jul 15, 2022 •

edited

Loading