Too Long; Didn't Read
Working on a data science project is almost always equivalent to an amazing clutter in the working directory. Data scientists would most likely have the following materials dumped in their project working directory: Python/R scripts, data sets, journal articles, references, notebooks, scripts, notebooks and other references. The directory heirarchy is organized as repo/src/python/main/R, repo/source/py/lib (for utilities), repo/s/lib/main (for scala codes) An ansible-playbooks are created to automated repeatitive tasks.
Share Your Thoughts