Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Start adding documentation and fix data races in query.go. #3749

Merged
merged 10 commits into from
Aug 6, 2019

Conversation

pawanrawal
Copy link
Contributor

@pawanrawal pawanrawal commented Aug 5, 2019

  1. Add comments explaining what various fields in SubGraph are used for and what are the functions doing.
  2. This change also fixes the remaining race conditions mentioned in Races while running flock #3685.
  3. Moves preTraverse related code to outputnode.go as it doesn't deal with the query execution and query.go is very long(3k lines).

This change is Reviewable

query/query.go Outdated
Order []*pb.Order // List of predicates to sort by and the sort order.
// TODO - Is Var only populated when this subgraph defines a variable or even when it needs it?
Var string // The name of the variable required by this SubGraph.
NeedsVar []gql.VarContext // The list of variables required by this SubGraph along with their type which can be uid or value.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

line is 128 characters (from lll)

@pawanrawal pawanrawal changed the title Start adding comments and fix data races in query.go. Start adding documentation and fix data races in query.go. Aug 5, 2019
Copy link
Contributor Author

@pawanrawal pawanrawal left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are still a lot more comments that I want to add which I'll do over time.

Reviewable status: 0 of 3 files reviewed, 5 unresolved discussions (waiting on @golangcibot)


query/outputnode.go, line 527 at r2 (raw file):

	return bufw.Bytes(), nil
}

All the code in this field has been moved from query.go without change, hence doesn't need to be looked at carefully.


query/query.go, line 125 at r1 (raw file):

Previously, golangcibot (Bot from GolangCI) wrote…

line is 128 characters (from lll)

Done.


query/query.go, line 1409 at r2 (raw file):

	if len(sg.Params.FacetVar) == 0 || sg.Params.Facet == nil {
		return nil
	}

Reversing the conditions to reduce the indentation of the code.


query/query.go, line 1546 at r2 (raw file):

	for _, v := range sg.Params.NeedsVar {
		l, ok := mp[v.Name]
		if !ok {

This allows us to reduce the nesting of the code below.


worker/task.go, line 48 at r2 (raw file):

var (
	emptyUIDList    pb.List

These global variables were the cause of the race condition since they are shared between different concurrent queries.

@pawanrawal pawanrawal marked this pull request as ready for review August 5, 2019 06:15
@pawanrawal pawanrawal requested review from manishrjain and a team as code owners August 5, 2019 06:15
Copy link
Contributor

@manishrjain manishrjain left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice change!

Can you confirm there's no logical changes, so I don't need to review that deeply. Also, once it's merged, can you run flock (ask Aman how to), keep it running, so you can identify any issues.

Reviewed 1 of 2 files at r1, 1 of 2 files at r2, 1 of 1 files at r3.
Reviewable status: all files reviewed, 5 unresolved discussions (waiting on @golangcibot)

Copy link
Contributor

@manishrjain manishrjain left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:lgtm:

Reviewable status: all files reviewed, 5 unresolved discussions (waiting on @golangcibot)

query/query.go Outdated
Var string
NeedsVar []gql.VarContext
Alias string
Count int // Value of first parameter.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The word first could be confusing. I initially thought there was a sequence of parameters somewhere, and first meant the first of those parameters. Could quote it may be.

query/query.go Outdated
FacetOrder string
FacetOrderDesc bool

// The name of the defined by this SubGraph.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Something is missing, probably meant defined variable

query/query.go Outdated
@@ -111,51 +110,76 @@ type Latency struct {
Json time.Duration `json:"json_conversion"`
}

// params contains the list of parameters required to execute a query.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

execute a Subgraph may be?

@@ -174,14 +198,29 @@ type Function struct {
// query and the response. Once generated, this can then be encoded to other
// client convenient formats, like GraphQL / JSON.
type SubGraph struct {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

query and subgraph are interchangeably used. Not sure whether it makes sense to keep the distinction (or there is one)

query/query.go Outdated
// execute this query.
Params params

// count stores the count of a predicate. There would be one value corresponding to each uid
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What does count of a predicate mean?

query/query.go Outdated
@@ -1104,11 +860,14 @@ func createTaskQuery(sg *SubGraph) (*pb.Query, error) {
return out, nil
}

// varValue is a generic representation of a variable and holds multiple things.
// TODO - Come back to this and document what do individual fields mean and when are they populated.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add TODO(name) everywhere.

query/query.go Outdated
for _, uid := range sg.SrcUIDs.Uids {
curVal, ok := sg.Params.uidToVal[uid]
if ok && types.CompareVals(sg.SrcFunc.Name, curVal, dst) {
sg.DestUIDs.Uids = append(sg.DestUIDs.Uids, uid)
}
}
} else {
// This means its a root as SrcUIDs is nil
// This means its a function at root as SrcUIDs is nil
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it's

Copy link
Member

@mangalaman93 mangalaman93 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can't run flock without the desktop. @animesh2049 could you run flock and check for race conditions? Happy to sit with you if needed.

Reviewable status: all files reviewed, 12 unresolved discussions (waiting on @golangcibot and @pawanrawal)

Copy link
Contributor Author

@pawanrawal pawanrawal left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is just one logical change which calls fillVars with sg.Params.ParentVars only if the current SubGraph has any children. That's a minor optimization. Apart from that there are no other logical changes.

I did run flock for 30 mins before my laptop killed the process and couldn't observe the race happening anymore. It happens in 5 mins without the change. Though would be nice for @animesh2049 to verify as well.

Reviewable status: 2 of 3 files reviewed, 12 unresolved discussions (waiting on @golangcibot, @mangalaman93, and @manishrjain)


query/query.go, line 113 at r3 (raw file):

Previously, mangalaman93 (Aman Mangal) wrote…

execute a Subgraph may be?

Done.


query/query.go, line 116 at r3 (raw file):

Previously, mangalaman93 (Aman Mangal) wrote…

The word first could be confusing. I initially thought there was a sequence of parameters somewhere, and first meant the first of those parameters. Could quote it may be.

Done


query/query.go, line 130 at r3 (raw file):

Previously, mangalaman93 (Aman Mangal) wrote…

Something is missing, probably meant defined variable

Done.


query/query.go, line 200 at r3 (raw file):

Previously, mangalaman93 (Aman Mangal) wrote…

query and subgraph are interchangeably used. Not sure whether it makes sense to keep the distinction (or there is one)

Done. That was a pretty old comment.


query/query.go, line 209 at r3 (raw file):

Previously, mangalaman93 (Aman Mangal) wrote…

What does count of a predicate mean?

Is count of an edge better?


query/query.go, line 864 at r3 (raw file):

Previously, mangalaman93 (Aman Mangal) wrote…

Please add TODO(name) everywhere.

Done.


query/query.go, line 1668 at r3 (raw file):

Previously, mangalaman93 (Aman Mangal) wrote…

it's

Done.

@pawanrawal pawanrawal merged commit 8ba4a85 into master Aug 6, 2019
@pawanrawal pawanrawal deleted the query-docs branch August 6, 2019 01:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

4 participants