Skip to content

Commit

Permalink
add some index
Browse files Browse the repository at this point in the history
  • Loading branch information
yindaheng98 committed Feb 27, 2024
1 parent 2708d7e commit 46cb6f1
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -173,6 +173,16 @@ e.g. write to `neo4j://localhost:7687`:
python -m dblp_crawler -k video -k edge -p l/JiangchuanLiu neo4j --uri neo4j://localhost:7687
```

#### Tips

Without index, NEO4J query will be very very slow. So before you start, you should add some index:

```cql
CREATE INDEX publication_title_hash_index FOR (p:Publication) ON (p.title_hash);
CREATE INDEX publication_dblp_key_index FOR (p:Publication) ON (p.dblp_key);
CREATE INDEX publication_doi_index FOR (p:Publication) ON (p.doi);
```

### Only crawl the paper after specified year

e.g. crawl the paper after 2016 (include 2016)
Expand Down

0 comments on commit 46cb6f1

Please sign in to comment.