2.1 Data Storage
You will have a separate GitHub repository that is dedicated to the final project. This will facilitate some of the peer evaluation tasks you will be asked to complete over the course of the semester, allowing you to share a repository with a colleague without exposing all of your other work and grades in the process.
All final project materials should be stored in this repository, and it should be organized following the project organization principles discussed in Sociospatial Data Science. This means that it should have subfolders for data/
, docs/
, results/
, and source/
as well as a .Rproj
file. If subfolders are empty, they will not be tracked by GitHub. If you are using only one computer, the folder will remain visible locally and Git will start tracking it once you begin adding materials. If you are using multiple computers, you can create empty text files named .gitkeep
using RStudio (File > New File > Text File
) and keep one in each subfolder.
Since you will be storing this repository on GitHub, you will be default be using version control for this project. You should make commits (like voting in Chicago!) early and often. Make sure that your commit messages are informative and clear. Remember, if you have to go back to an earlier version of your work, you will want to make it as easy as possible to do so.
Also make sure that you are pushing to GitHub often and not just saving the work locally on your computer. This is part of your insurance against a catastrophic loss of your computer or a lab computer where you are working. You are strongly encouraged to use a more comprehesive backup solution, which you can lean more about in Sociospatial Data Science.