edited README and env.sample

This commit is contained in:
Ben Varick 2025-11-07 08:43:48 -08:00
parent f85935dd75
commit 575311edc6
Signed by: ben
SSH key fingerprint: SHA256:jWnpFDAcacYM5aPFpYRqlsamlDyKNpSj3jj+k4ojtUo
2 changed files with 13 additions and 7 deletions

View file

@ -6,13 +6,12 @@ Dagster setup that scrapes GTFS and GTFS-RT for specified transit agencies and a
This reads from the `config/agency_list.csv` file, edit this file to include the transit agencies that you want to scrape, add the relevant IDs from mobilitydatabase.org This reads from the `config/agency_list.csv` file, edit this file to include the transit agencies that you want to scrape, add the relevant IDs from mobilitydatabase.org
## set your environment ## set your environment
### Data directory
edit dagster.yaml to specify the correct data directory under run_launcher.
Right now it says: `/home/ben/code/gtfs-dagster/data:/opt/dagster/app/data`, change the first part to where you want the data to be stored.
### .env file ### .env file
copy `env.sample` to `.env` and change the database password and the mobility database refresh token copy `env.sample` to `.env` and change:
- Postgres database password - make it something random before the first run
- MobilityDatabase.org API token
- Location of data, config, and postgres_data directories (default is in working directory)
# Run it # Run it
`docker compose build` `docker compose build`

View file

@ -1,6 +1,13 @@
# Postgres credentials
POSTGRES_USER=dagster POSTGRES_USER=dagster
POSTGRES_PASSWORD=change-me POSTGRES_PASSWORD=change-me-to-something-random
POSTGRES_DB=dagster POSTGRES_DB=dagster
MOBILITY_DB_REFRESH_TOKEN=put-token-from-mobilitydatabase-here # Storage locations
DATA_DIRECTORY=${PWD}/data
POSTGRES_DIRECTORY=${PWD}/postgres_data
CONFIG_DIRECTORY=${PWD}/config
# MobilityDatabase.org API token
MOBILITY_DB_REFRESH_TOKEN=get-from-MobilityDatabase.org