I’m back from being sick!
Anyways, I was starting work on a PR for this issue when I noticed we could probably do better with the functions.
Specifically, my proposal would be to switch this gigantic struct into a smaller struct
// Function holds the information about gql functions.
type Function struct {
Attr string
Lang string // language of the attribute value
Args []Arg // Contains the arguments of the function.
UID []uint64
NeedsVar []VarContext // If the function requires some variable
IsValueVar bool // eq(val(s), 5)
IsLenVar bool // eq(len(s), 5)
Fn
}
// NEW TYPE HERE THAT ENUMERATES THE FUNCTIONS SUPPORTED BY DQL
type Fn uint16 // Fn is probably not a great name.
const (
count Fn = iota
eq
...
)
The primary benefit is that all functions that DQL supports are enumerated once and exactly once, leading to an easy definition of things later on.
Also there appears to be some small benefits to the struct changes. The new struct is 16 bytes smaller as shown here.
While not much of an impact, this will also have impacts on github.com/dgraph-io/dgraph/query.Function.
Again, having defined and enumerating the functions in the gql
package, we can free up a lot of additional resources in the downstream. Instead of having a massive switch statement worker/task.go, you can just replace that with a map[parser.Fn]FuncType
that is created at init
, populated with the correct functions. Less work done.
This also makes the String functions and the datetime functions issues very much easier to address. Again, simply update the enumerations, and the maps in the correct places, and done!
Now I know we are not adding anything to GQL any time soon, but this tiny refactor buys a lot of flexibility down the road. I don’t mind taking it up on a part time basis either. I’ve quite a bit of experience implementing programming languages so I know what sort of pitfalls to look out for.
But, of course I could be very wrong and missing something important, because I’m new to the codebase.
Pros:
- Buys us flexibility down the road
- Potential performance gains due to the computer doing much less work in the first place
Cons:
- This proposal is written by a n00b who may not know all the intricacies of the system yet
- Really large footprint for a tiny refactor. Pretty much the entire query lang chain is touched.