taxi

Winning entry to the Kaggle taxi competition
git clone https://esimon.eu/repos/taxi.git
Log | Files | Refs | README

commit 87fc87384e6d9b7d88ca622a17dac7b8bc15cacb
parent c97af300b17ac042c52cfc54f43d4f01fd61fbe9
Author: Étienne Simon <esimon@esimon.eu>
Date:   Tue, 14 Jul 2015 07:58:20 -0400

Add note about PYTHONPATH in README

Diffstat:
MREADME.md | 2++
1 file changed, 2 insertions(+), 0 deletions(-)

diff --git a/README.md b/README.md @@ -40,6 +40,8 @@ Here is a brief description of the Python files in the archive: ## How to reproduce the winning results? There is an helper script `prepare.sh` which might helps you (by performing step 1-6 and some other checks), but if you encounter an error, the script will re-execute all the steps from the beginning (before the actual training, step 2, 4 and 5 are quite long). + +Note that some script expect the repository to be in your PYTHONPATH (go to the root of the repository and type `export PYTHONPATH="$PWD:$PYTHONPATH"`). 1. Set the `TAXI_PATH` environment variable to the path of the folder containing the CSV files. 2. Run `data/csv_to_hdf5.py "$TAXI_PATH" "$TAXI_PATH/data.hdf5"` to generate the HDF5 file (which is generated in `TAXI_PATH`, along the CSV files). This takes around 20 minutes on our machines.