The fundamentals of data analysis in R
Having a statistical approach to solving problems has perhaps never been as necessary as it is today in business. From automatic image recognition on Facebook to unemployment figures, from opinion polls to the detection of gravitational waves, data and its interpretation have become part of our modern societies. Considered in recent years as the most complete and powerful software for advanced statistical data analysis, R now has a large community of users. Faithful to the Open Source philosophy, they have gradually created and made available reusable extensions that allow, for example, easy connection to databases or that offer analysis models specifically adapted to particular fields (engineering, quantitative finance, sociology, economics, marketing, …). This training on R will enable participants to create their statistical analyses and learn how to reproduce them in graphical form.
The objective of this training is to provide the basis for a critical understanding of the data and its interpretation. To do this, we will take advantage of two approaches:
- We will devote a good part of the training to the study of basic statistical concepts: correlation, causality, inference, hypothesis testing, etc. These notions will be approached not through the prism of mathematical formalism, but through an intuitive understanding of the techniques employed in R. We will combine theory with practice, working on a small survey that will be produced from A to Z by the course participants.
- We will also bring the insights produced by the social and management sciences on what is called quantification: can we perceive "reality" through statistical categories?
The details
The main elements of this course include an introduction to the fundamentals of statistics:
- General introduction
- Statistics as a tool for objectification: finding a problem
- From scientific questioning to inquiry: proposing questions
- Development of the questionnaire
- Population survey: Begin distribution of questionnaires
- Statistical vocabulary
- Introduction to data analysis with R
- Explore a database
- Univariate descriptive statistics
- Bivariate descriptive statistics
- Statistical Inference with R
The clients
This training is intended for all analysts and professionals seeking to acquire the necessary skills that will help them conduct or understand data mining and segmentation in your business.
- Level : Beginner
- Duration : Two days