Invalid memory address or nil pointer dereference error

BlankRain · September 17, 2020, 8:32am

invalid memory address or nil pointer dereference
SIGSEGV: segmentation violation code=0x1 addr=0x0 pc=0x14fb4be

upsert{
  query{
    v as var(func: eq(id,12)) # this one has 1 000 000 followers
  }
  mutation {
   delete{
    uid(v) <followers>  * .
   }
  }
}

followers has count and reverse index.

code at this line

github.com

dgraph-io/dgraph/blob/master/posting/index.go#L280


	}

	return nil
}

func (l *List) handleDeleteAll(ctx context.Context, edge *pb.DirectedEdge, txn *Txn) error {
	isReversed := schema.State().IsReversed(ctx, edge.Attr)
	isIndexed := schema.State().IsIndexed(ctx, edge.Attr)
	hasCount := schema.State().HasCount(ctx, edge.Attr)
	delEdge := &pb.DirectedEdge{
		Attr:   edge.Attr,
		Op:     edge.Op,
		Entity: edge.Entity,
	}
	// To calculate length of posting list. Used for deletion of count index.
	plen := l.Length(txn.StartTs, 0)
	err := l.IterateAll(txn.StartTs, 0, func(p *pb.Posting) error {
		switch {
		case isReversed:
			// Delete reverse edge for each posting.
			delEdge.ValueId = p.Uid

chewxy · September 18, 2020, 10:45am

@ibrahim ideas?

@BlankRain - wondering if you could share the dataset?

ibrahim · September 18, 2020, 11:35am

@BlankRain can you please show the stack trace? and the version on which you saw this panic.

BlankRain · September 22, 2020, 7:52am

Dgraph version   : v20.07.0-12-g681fe9116
Dgraph codename  : shuri-mod
Go version       : go1.14.1

@chewxy we use twitter data:

wget http://an.kaist.ac.kr/\~haewoon/release/twitter_social_graph/twitter_rv.tar.gz 

# split by line
split -l 54535107 twitter_rv.net

then use the go code to convert csv data to rdf

package main
import (
    "path/filepath"
    "os"
    "fmt"
	"flag"
	"bufio"
	"io"
	"strings"
)

func getFilelist(path string) []string {
	files := []string{}
    err := filepath.Walk(path, func(path string, f os.FileInfo, err error) error {
        if f == nil {
			return err
		}
        if f.IsDir() {
			return nil
		}
		files = append(files,path)
        return nil
    })
    if err != nil {
        fmt.Printf("filepath.Walk() returned %v\n", err)
	}
	return files
}

func rdf(f string, separator string, ch chan string) {
	rv, err := os.Open(f)
	
	if err != nil {
		fmt.Println("open file err=", err)
	    rv.Close()
		return
	}

	defer rv.Close()

	out, _ := os.OpenFile(f+".rdf", os.O_RDWR|os.O_CREATE|os.O_APPEND, 0644)
	
	reader := bufio.NewReader(rv)
	wirter := bufio.NewWriter(out)

	nodes := make(map[string]bool)
	//循环的读取文件的内容
	errcount := 0
	for {
		str, err := reader.ReadString('\n') // 读到一个换行就结束
		if err == io.EOF {                  // io.EOF表示文件的末尾
			break
		}
		//输出内容
		line := strings.Split(str, separator)
		if len(line) != 2 {
			errcount++
			fmt.Printf("from now on find %d errors in %s\n", errcount, f)
			continue
		}
		src := strings.Trim(line[0], "\n")
		dst := strings.Trim(line[1], "\n")
		tpl := "_:v%s <id> \"%s\" . \n_:v%s <dgraph.type> \"twitter_user\" .\n"
		if !nodes[src] {
			lout := fmt.Sprintf(tpl, src, src, src)
			wirter.Write([]byte(lout))
			nodes[src] = true
		}
		if !nodes[dst] {
			lout := fmt.Sprintf(tpl, dst, dst, dst)
			wirter.Write([]byte(lout))
			nodes[dst] = true
		}

		lout := fmt.Sprintf("_:v%s <followers> _:v%s . \n", src, dst)
		wirter.Write([]byte(lout))
	}
	wirter.Flush()
	out.Close()
	ch <- f + ".rdf is ok!"
}

func main(){
    flag.Parse()
	separatorKey := flag.Arg(0)
	root := flag.Arg(1)

	separatorMap := map[string]string{"s": " ", "t": "\t", "c": ","}
	separator := separatorMap[separatorKey]

	if 0 == len(separator) {
		fmt.Printf("Please specify the separator！\n t for Tab \n s for Space \n c for comma \n such as: ./gocsv2rdf s ./test \n")
		return
	}

	if 0 == len(root) {
		fmt.Printf("Please specify the CSV folder！\n such as: ./gocsv2rdf s ./test \n")
		return
	}

	files :=getFilelist(root)
	fmt.Printf("%v\n", files)

	
	count := len(files)
	if 0 == count {
		fmt.Printf("Please specify the correct CSV folder！\n")
		return
	}
	ch := make(chan string, count)
	for i:= 0;i<count;i++{
      	name := files[i]
		fmt.Printf("Runninng for %s\n", name)
		//rdf(name, separator, ch)
		go rdf(name, separator, ch) //内存足够大就用这行代码,并行处理
	}
	i := 0
	for x := range ch {
		fmt.Println(x)
		i++
		if i == int(count) {
			break
		}
	}
	fmt.Println("Over!")
}

then bulk load or live load into dgraph.

The schema is

id: int @index(int) .
followers: [uid]  .

type twitter_user {
    id
    followers
}

ibrahim · September 22, 2020, 8:21am

Hey @BlankRain, can you show me the complete stack trace of the crash? I’m trying to figure out what sequence of function calls led to this crash.

BlankRain · September 22, 2020, 8:36am

Hi ,
This is the stack trace .

ibrahim · September 23, 2020, 6:17am

@BlankRain It looks like you’re running a modified version of dgraph

Dgraph version   : v20.07.0-12-g681fe9116
Dgraph codename  : shuri-mod
Go version       : go1.14.1

Can you try running on a released version of dgraph? I don’t have the code you’re running and I cannot debug it .

BlankRain · September 23, 2020, 7:17am

Hi
The code based on the release of v20.07.0 .

I modified the code to fix some bugs .
for example:
Bulk loader crashes during reduce phase - #26 by BlankRain
The git log ,I based is this one.

commit 7431be0dce0ffc42e3ff3e31b39fcdd5505e2bb3
Author: Martin Martinez Rivera <martinmr@dgraph.io>
Date:   Mon Aug 3 11:54:25 2020 -0700

    add cluster lables to the jaeger containers (#5951) (#6009)

ibrahim · September 23, 2020, 7:37am

Thanks @BlankRain. So the call to posting() is failing because uidPosting is nil on line 273.

github.com

dgraph-io/dgraph/blob/7431be0dce0ffc42e3ff3e31b39fcdd5505e2bb3/posting/list.go#L260-L275


func (it *pIterator) posting() *pb.Posting {
	uid := it.uids[it.uidx]

	for it.pidx < it.plen {
		p := it.plist.Postings[it.pidx]
		if p.Uid > uid {
			break
		}
		if p.Uid == uid {
			return p
		}
		it.pidx++
	}
	it.uidPosting.Uid = uid
	return it.uidPosting
}

But it looks like we initialize the it.uidPosting in init which is called by iterate

github.com

dgraph-io/dgraph/blob/7431be0dce0ffc42e3ff3e31b39fcdd5505e2bb3/posting/list.go#L142


	}

	it.afterUid = afterUid
	it.deleteBelowTs = deleteBelowTs
	if deleteBelowTs > 0 {
		// We don't need to iterate over the immutable layer if this is > 0. Returning here would
		// mean it.uids is empty and valid() would return false.
		return nil
	}

	it.uidPosting = &pb.Posting{}
	it.dec = &codec.Decoder{Pack: it.plist.Pack}
	it.uids = it.dec.Seek(it.afterUid, codec.SeekCurrent)
	it.uidx = 0

	it.plen = len(it.plist.Postings)
	it.pidx = sort.Search(it.plen, func(idx int) bool {
		p := it.plist.Postings[idx]
		return it.afterUid < p.Uid
	})
	return nil

@BlankRain would you be able to share your dataset? We can try to reproduce it but I doubt if we can easily reproduce it on our end. If you can share your dataset, we can look at it to figure out what’s causing this.

@animesh2049 Do you see any reason pb.Posting can be nil here? The cluster is running in ludicrous mode.

BlankRain · September 23, 2020, 8:00am

I can share my dataset. It may be too huge to copy.
My dataset is based on the twitter data.
I can show you how to produce the data.

wget http://an.kaist.ac.kr/\~haewoon/release/twitter_social_graph/twitter_rv.tar.gz 

tar xvf twitter_rv.tar.gz 

split -l 54535107 twitter_rv.net

mkdir test

mv x* test

rm test/*.rdf
./gocsv2rdf t test

# then copy test/*.rdf out for bulk load and start the dgraph cluster

the source code of gocsv2rdf is here

package main
import (
    "path/filepath"
    "os"
    "fmt"
	"flag"
	"bufio"
	"io"
	"strings"
)

func getFilelist(path string) []string {
	files := []string{}
    err := filepath.Walk(path, func(path string, f os.FileInfo, err error) error {
        if f == nil {
			return err
		}
        if f.IsDir() {
			return nil
		}
		files = append(files,path)
        return nil
    })
    if err != nil {
        fmt.Printf("filepath.Walk() returned %v\n", err)
	}
	return files
}

func rdf(f string, separator string, ch chan string) {
	rv, err := os.Open(f)
	
	if err != nil {
		fmt.Println("open file err=", err)
	    rv.Close()
		return
	}

	defer rv.Close()

	out, _ := os.OpenFile(f+".rdf", os.O_RDWR|os.O_CREATE|os.O_APPEND, 0644)
	
	reader := bufio.NewReader(rv)
	wirter := bufio.NewWriter(out)

	nodes := make(map[string]bool)
	//循环的读取文件的内容
	errcount := 0
	for {
		str, err := reader.ReadString('\n') // 读到一个换行就结束
		if err == io.EOF {                  // io.EOF表示文件的末尾
			break
		}
		//输出内容
		line := strings.Split(str, separator)
		if len(line) != 2 {
			errcount++
			fmt.Printf("from now on find %d errors in %s\n", errcount, f)
			continue
		}
		src := strings.Trim(line[0], "\n")
		dst := strings.Trim(line[1], "\n")
		tpl := "_:v%s <id> \"%s\" . \n_:v%s <dgraph.type> \"twitter_user\" .\n"
		if !nodes[src] {
			lout := fmt.Sprintf(tpl, src, src, src)
			wirter.Write([]byte(lout))
			nodes[src] = true
		}
		if !nodes[dst] {
			lout := fmt.Sprintf(tpl, dst, dst, dst)
			wirter.Write([]byte(lout))
			nodes[dst] = true
		}

		lout := fmt.Sprintf("_:v%s <followers> _:v%s . \n", src, dst)
		wirter.Write([]byte(lout))
	}
	wirter.Flush()
	out.Close()
	ch <- f + ".rdf is ok!"
}

func main(){
    flag.Parse()
	separatorKey := flag.Arg(0)
	root := flag.Arg(1)

	separatorMap := map[string]string{"s": " ", "t": "\t", "c": ","}
	separator := separatorMap[separatorKey]

	if 0 == len(separator) {
		fmt.Printf("Please specify the separator！\n t for Tab \n s for Space \n c for comma \n such as: ./gocsv2rdf s ./test \n")
		return
	}

	if 0 == len(root) {
		fmt.Printf("Please specify the CSV folder！\n such as: ./gocsv2rdf s ./test \n")
		return
	}

	files :=getFilelist(root)
	fmt.Printf("%v\n", files)

	
	count := len(files)
	if 0 == count {
		fmt.Printf("Please specify the correct CSV folder！\n")
		return
	}
	ch := make(chan string, count)
	for i:= 0;i<count;i++{
      	name := files[i]
		fmt.Printf("Runninng for %s\n", name)
		//rdf(name, separator, ch)
		go rdf(name, separator, ch) //内存足够大就用这行代码,并行处理
	}
	i := 0
	for x := range ch {
		fmt.Println(x)
		i++
		if i == int(count) {
			break
		}
	}
	fmt.Println("Over!")
}

go build it then run it.
all this may need at least 30min or more.

BlankRain · September 23, 2020, 8:05am

upsert{
  query{
    v as var(func: eq(id,12)) # this one has 1 000 000 followers
  }
  mutation {
   delete{
    uid(v) <followers>  * .
   }
  }
}

I do some delete in upsert block.
the reason pb.Posting can be nil may around here.

ibrahim · September 23, 2020, 8:24am

Thanks @BlankRain. I’ve accepted this as a bug. @animesh2049 will run some tests locally on your dataset and get back to you.

animesh2049 · September 24, 2020, 12:04pm

Hey @BlankRain I have pushed a change, commit id 7873f04087e671b5e45d87aaa1875680324a0f7b. Can you please apply this commit and see if you are still getting the error.

BlankRain · September 25, 2020, 1:46am

hi, which branch ?

chewxy · September 25, 2020, 2:33am

you can just git checkout 7873f040

BlankRain · September 25, 2020, 3:20am

ok,thanks ,let me try

BlankRain · September 25, 2020, 6:11am

just post the related pr here: Initialize posting list in moveToNextPart by animesh2049 · Pull Request #6560 · dgraph-io/dgraph · GitHub

animesh2049 · September 28, 2020, 7:58am

Hey @BlankRain are you still facing the issue ?

BlankRain · September 28, 2020, 8:02am

Sorry , we didn’t finish the test yet. Our cluster is broken. I will let you know when we finish the test.

chewxy · September 30, 2020, 9:57pm

2 posts were split to a new topic: Tokenizer panics (type error)

Topic		Replies	Views
Panic while migrating Dgraph	0	488	February 27, 2022
Simple mutation - A panic is trapped GraphQL status:accepted , ticket:created	5	536	May 13, 2021
Bugs in delete preds when the postinglist type is BitDeltaPosting (0x4) Dgraph status:accepted , kind:bug , ticket:created	10	718	January 19, 2021
Alpha node get restarted with: invalid memory address or nil pointer dereference Dgraph dgraph , status:accepted , kind:bug , priority:p0 , area:crash	7	799	December 2, 2019
Error in dgraphassigner when using prebuilt binaries Users	3	862	October 1, 2016

Invalid memory address or nil pointer dereference error

Related topics