Submission

How to submit: 

  • Prediction results should be uploaded to the Box links sent to the Team Leaders. If you haven’t received a link, please let us know at takahiroyabe@nyu.edu 
    • ​You do not need to send submission emails or anything else — just upload your predictions! 

Format of submission: 

  • The attached files should be named as {teamname}_{cityB,cityC,cityD}_humob24.csv.gz. For example, team name ‘dreamteam’ submitting their solutions for city B should submit their prediction as dreamteam_cityB_humob.csv.gz [updated July 31]
  • The format of the submission should include the same 5 columns as the original dataset (user ID, day, timeslot, x, y). Separate the columns using commas (,) and include no redundant spaces, and save the file using the csv.gz format.
  • Only send the data for the predicted users. In other words, for each city B, C, and D, submit only 3000 users’ datasets. 
  • Please use the Validator toolkit (below) to make sure your submissions formats are correct!  

Validator toolkit: 

  • To avoid mistakes in submission format, we implemented a submission file data format validation code at https://github.com/yahoojapan/geobleu.
  • You can check whether your submission files matches the requirements with a standalone python program, validator.py. It takes the task id and submission file path as arguments and emits errors if it finds anything wrong regarding the number of columns, uid, and value ranges of d, t, x, and y. A submission file may begin with the header line “uid,d,t,x,y”, while omitting it is also acceptable.
    • For example, assuming your submission file for task 1 before compression is at foo/dreamteam_cityb_humob.csv, the command will be: python3 validator.py 1 foo/dreamteam_cityb_humob.csv
  • The line number in error messages is 0-indexed. If the tool doesn’t find anything, it will just say “Validation finished without errors!”. Please only submit files that have passed this validation step!

Other rules: 

  • Only 1 submission per team would be evaluated. The final submission before the deadline (September 20th 23:59 AOE) will be considered as the final submission.
  • Cite the data descriptor document in your submissions and future work where the HuMob data is used:
    • Yabe, T., Tsubouchi, K., Shimizu, T., Sekimoto, Y., Sezaki, K., Moro, E., & Pentland, A. (2024). YJMob100K: City-scale and longitudinal dataset of anonymized human mobility trajectories. Scientific Data11(1), 397. https://www.nature.com/articles/s41597-024-03237-9 

Multi-City Human Mobility Prediction