GitHunt
SP

Spantree/instacart-neo4j

Playing with Instacart data in Neo4j

400 trash bags of grocery receipts + Neo4j

Help loading data and sample queries from the meetup
talk
.

Check out the slides
here
.

Running neo4j with the Instacart data

Prerequisites

To run this locally, you'll need

  • a local docker environment
  • the instacart data (in tar.gz format) downloaded and moved to the root
    of this project directory

You can download the data
here, and
must agree to the terms and
conditions
.
For more information about the contents of the files, see this data
dictionary
.

If you don't want to use docker, you can also run neo4j in other ways,
but for the ease of setup, I've chosen to use docker. The biggest thing
you'll have to do manually if you don't use docker is move the raw data
to the equivalent of /var/lib/neo4j/import for the method you're
running neo4j.

Prepping the data

There's a script, prep-data.sh, that will untar the data and then
scrub a CSV that has improperly escaped doublequotes. Once you've run
that script, you should be able to run the docker container and start
executing the import queries.

All in all that should look like:

./prep-data.sh
docker-compose up

Now check out the cypher directory of this project for queries to
import the data and all the queries I use in the presentation.

Languages

Shell100.0%

Contributors

Created July 20, 2017
Updated October 8, 2021
Spantree/instacart-neo4j | GitHunt