class: center, middle, inverse, title-slide # Programming Tools in Data Science ## Lecture #1: Introduction ### Samuel Orso ### 27 September 2021 --- # Motivation * "data science" hits ~3% of jobs on jobup.ch <center><iframe width="640" height="480" src="https://www.youtube.com/embed/Tzin1DgexlE"> </iframe></center> --- # General goals * introduce tools and workflows for reproducible research (R/RStudio, Git/GitHub, etc.); * introduce principles of tidy data and tools for data wrangling; * exploit data structures to appropriately manage data, computer memory and computations; * data manipulation through controls, instructions, and tailored functions; * develop new software tools including functions, Shiny applications, and packages; * manage the software development process including version control, documentation (with embedded code), and dissemination for other users. --- # General goals <img src="images/diagram.png" width="593" height="459" style="display: block; margin: auto;" /> --- class: sydney-blue, center, middle # Course logistic and expectation --- # Course logistic and expectation ## Location and time .pull-left[ .scroll-box-5[
]] .pull-right[ .scroll-box-5[ * Anthropole 3741 * Every Monday morning: class 9 to 10, practical 10:15 to 12 * Watch video (~30 min) every week before class * It is possible to follow the class on Zoom * Some classes maybe given on Zoom (to be defined) ]] --- # Course logistic and expectation ## Material * You need a laptop * We work mainly with <img src="images/rlogo.png" width="150px"/> and <img src="images/rstudio.png" width="300px"/> * All software we work with will be free for academic purposes --- # Course logistic and expectation ## Requirements * No IT background is assumed from the students but a strong will to learn useful and practical programming skills ([Data Science in Business Analytics](https://tvatter.github.io/dsfba_2020/)) * Willing to work and collaborate in groups (4~6 people) * Be ready to struggle with your computer! <center><img src="https://media.giphy.com/media/bPCwGUF2sKjyE/giphy.gif" alt="gif"/></center> --- # Course logistic and expectation ## Grading * Learning outcomes will be assessed based on the performances within each of the following categories: Type | Points | Bonus :-- | :-- | :-- Semester project | 30 | 3 Homeworks | 30 | 3 * 4 homeworks in groups of 7.5 points (**penalty for late submission**). * No final examination for this class. * Final presentation of project last day of class (20th Dec). --- # Course logistic and expectation ## Communication * We use <img src="images/slack.png" width="200px"/> to communicate and many more * We use the **NEIN rule**! (No Email, only If Necessary) * More info at [https://ptds.samorso.ch/](https://ptds.samorso.ch/) * To access slack: register at [https://shiny.samorso.ch/fillingform/](https://shiny.samorso.ch/fillingform/) and wait your invitation <img src="images/qrcode_data-analytics-lab.shinyapps.io.png" width="150px"/> --- class: sydney-blue, center, middle # Question ? .pull-down[ <a href="https://ptds.samorso.ch/"> .white[<svg viewBox="0 0 384 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M369.9 97.9L286 14C277 5 264.8-.1 252.1-.1H48C21.5 0 0 21.5 0 48v416c0 26.5 21.5 48 48 48h288c26.5 0 48-21.5 48-48V131.9c0-12.7-5.1-25-14.1-34zM332.1 128H256V51.9l76.1 76.1zM48 464V48h160v104c0 13.3 10.7 24 24 24h104v288H48z"></path></svg> website] </a> <a href="https://github.com/ptds2021/"> .white[<svg viewBox="0 0 496 512" style="height:1em;position:relative;display:inline-block;top:.1em;" xmlns="http://www.w3.org/2000/svg"> <path d="M165.9 397.4c0 2-2.3 3.6-5.2 3.6-3.3.3-5.6-1.3-5.6-3.6 0-2 2.3-3.6 5.2-3.6 3-.3 5.6 1.3 5.6 3.6zm-31.1-4.5c-.7 2 1.3 4.3 4.3 4.9 2.6 1 5.6 0 6.2-2s-1.3-4.3-4.3-5.2c-2.6-.7-5.5.3-6.2 2.3zm44.2-1.7c-2.9.7-4.9 2.6-4.6 4.9.3 2 2.9 3.3 5.9 2.6 2.9-.7 4.9-2.6 4.6-4.6-.3-1.9-3-3.2-5.9-2.9zM244.8 8C106.1 8 0 113.3 0 252c0 110.9 69.8 205.8 169.5 239.2 12.8 2.3 17.3-5.6 17.3-12.1 0-6.2-.3-40.4-.3-61.4 0 0-70 15-84.7-29.8 0 0-11.4-29.1-27.8-36.6 0 0-22.9-15.7 1.6-15.4 0 0 24.9 2 38.6 25.8 21.9 38.6 58.6 27.5 72.9 20.9 2.3-16 8.8-27.1 16-33.7-55.9-6.2-112.3-14.3-112.3-110.5 0-27.5 7.6-41.3 23.6-58.9-2.6-6.5-11.1-33.3 2.6-67.9 20.9-6.5 69 27 69 27 20-5.6 41.5-8.5 62.8-8.5s42.8 2.9 62.8 8.5c0 0 48.1-33.6 69-27 13.7 34.7 5.2 61.4 2.6 67.9 16 17.7 25.8 31.5 25.8 58.9 0 96.5-58.9 104.2-114.8 110.5 9.2 7.9 17 22.9 17 46.4 0 33.7-.3 75.4-.3 83.6 0 6.5 4.6 14.4 17.3 12.1C428.2 457.8 496 362.9 496 252 496 113.3 383.5 8 244.8 8zM97.2 352.9c-1.3 1-1 3.3.7 5.2 1.6 1.6 3.9 2.3 5.2 1 1.3-1 1-3.3-.7-5.2-1.6-1.6-3.9-2.3-5.2-1zm-10.8-8.1c-.7 1.3.3 2.9 2.3 3.9 1.6 1 3.6.7 4.3-.7.7-1.3-.3-2.9-2.3-3.9-2-.6-3.6-.3-4.3.7zm32.4 35.6c-1.6 1.3-1 4.3 1.3 6.2 2.3 2.3 5.2 2.6 6.5 1 1.3-1.3.7-4.3-1.3-6.2-2.2-2.3-5.2-2.6-6.5-1zm-11.4-14.7c-1.6 1-1.6 3.6 0 5.9 1.6 2.3 4.3 3.3 5.6 2.3 1.6-1.3 1.6-3.9 0-6.2-1.4-2.3-4-3.3-5.6-2z"></path></svg> GitHub] </a> ]