3 Data & Analyses

3.1 Organization

You will create, acquire, and accumulate an enormous amount of files during grad school and much, much more throughout your career. So it is truly imperative to establish a good file naming system and a good folder system for storing everything. The best systems will not only make sense to you, but could also be easily navigable to others. This document is an excellent and simple guide for naming files: Data Science of Marine Conservation

Seriously, include the date in your files and use the International Organization for Standardization (ISO) format.

Figure 3.1: Seriously, include the date in your files and use the International Organization for Standardization (ISO) format.

3.2 Shared folders

We will use a file/folder sharing system in the FEL. Currently, that system is Box. This will be used for group projects as well as one-on-one sharing with Stallings.

For the one-on-one parent folder, there will four subfolders:

  • Funding: this will have subfolders for every scholarships, fellowship, etc that you apply for. The subfolders will house the funding instructions, application, CV, etc.
  • General: this is a catch-all subfolder for items that don’t quite fit in with the other subfolders.
  • Presentations: this will have subfolders for each presentation you give at meetings and seminars.
  • Thesis: this folder will house all your thesis-based data and products. There will be subfolders for each chapter, the proposal (including the proposal presentation here instead of the presentations folder), data, code for analysis, and the defense presentation (here instead of the presentations folder)

3.3 Analytical tools

I strongly encourage you to use R. Regardless of the programming language you choose, having good data management and workflow is critical.