Full text search story

msonawane · April 3, 2019, 4:01am

How is full text search with dgraph ? Trying to see if it can replace solr in following hypothetical scenario.

news database:
mysql and solr for search. data is duplicated and about 40 million news. adding two million news articles every day
each news has tags (location, type (crime, entertainment etc), keywords (buried in description - where we use solr for search )

How is full text search capability of solr ? how many languages we can tokenise ?
will it be better if full text search is not used and insted we find out keywords from text field and create edges for the keywords ? ( example location , technology ( java, golang etc ) can be created as edges ( has_java, has_go etc ). will that be faster to search ? possibly i would like to search all news articles from hongkong (location_hongkong) and has_go edge. will that be faster than creating full text search ?
how will you advise to model this schema ?

martinmr · April 3, 2019, 9:15pm

More on full-text search, including the list of languages supported: https://docs.dgraph.io/query-language/#full-text-search

We use GitHub - blevesearch/bleve: A modern text indexing library for go to create the full-text indexes so we should be able to support whatever new features/languages are added to that library.

If you mostly intend to search by looking at tags and keywords, you’ll get much better performance by creating edges for them. Full-text indexes are expensive to create and take up a lot of space.

With regards to the possible schema here’s one possibility.

text: string .
keywords: string @index(term) .
location: geo @index(geo) .
type: string @index(term) .

Of course, I am not sure what your exact use-case is but having the article metadata as edges will be much optimal than indexing a raw string and using search to look for it.

Topic		Replies	Views
Feature request: full text search with tf-idf Scoring Dgraph dgraph , status:accepted , kind:feature , area:querylang , exp:expert	11	1778	January 11, 2021
Get Started with Dgraph - Advanced Text Search on Social Graphs - Dgraph documentation Documentation	0	498	August 28, 2020
Getting started with Dgraph tutorials series - 6: Advanced text search on social graphs - Dgraph Blog Blog	0	666	December 11, 2019
Fuzzy Full Text Search Dgraph kind:question , dql	0	1202	December 4, 2021
[RFC] TF-IDF scoring for fulltext search in Dgraph Dev rfc , dgraph , area:querylang	6	1025	June 3, 2021

Full text search story

Related topics