The art and science of making sense of data is a highly sought after skill in today’s data driven world.

 

Data science isn’t just for data scientists. In massively connected data driven world, it is imperative that the workforce of today and tomorrow is able to understand what data is available and use scientific methods to analyze and interpret it.

Data science is now recognized as a highly-critical growth area with impact across many sectors including science, government, finance, health care, telecom, manufacturing, advertising, retail, and others.

Launch your data science career with this practical workshop. Build a solid foundation in machine learning using R and start exploring data-related careers.


 

Top 9 Reasons to Attend

 
  • Understand the art and science of discovering patterns and making intelligent predictions from big data.

  • Define machine learning, why it matters, and discuss its relationship to analytics, data science, and big data.

  • Machine learning fundamentals, the importance of algorithms, and machine learning as a service.

  • Basics of R platform, programming language concepts, common and useful R commands, and applying machine learning methods.

  • Doing machine learning - Understanding the steps in the machine learning pipeline, from data acquisition and feature generation, to training and model selection.

  • Practically learn the most commonly used machine learning methods, covering both supervised and unsupervised learning.

  • Develop understanding of which algorithm to choose based on the analytics challenge and the data you have.

  • Be able to appreciate the trade-offs involved in choosing particular techniques for particular problems.

  • Discover how to understand, interpret and convey the results of data science life cycle.



  •  

Agenda

The workshop has a strong focus on gaining hands-on experience implementing algorithms and building predictive models on real datasets. By the end of the 3 days, participants will be ready to implement the machine learning algorithms using data science on your own data, and immediately generate value.

The workshop will take participants through the conceptual and applied foundations of the subject. Topics covered include:

   R for Statistical Analysis and Machine Learning

   Machine learning theory, types of learning

   Techniques, models and methods

 

Labs are developed to practically learn how to use the R programming language and packages for applying the main concepts and techniques of data science and machine learning.

Lab 1 - R IDE Setup and Basic Commands

Lab 2 – Data Frames, Functions, using Packages and Basic Plotting

Lab 3 – Feature Selection

Often we have far too many features to work with. In this lab, we see how to use pairwise statistical tests to select high-information features and discard low-information features.

Lab 4 – Feature Transformation

Principle component analysis creates a new set of features as linear combinations of the original features. These are ordered by the amount of variance each contains and selecting a subset of high-variance principle components provides a powerful way to both reduce the number of features used and ensure that those used have high information content.

Lab 5 - Linear Regression

Use ordinary least squares to model the relationship between (X1) the education requirements of a career and (X2) its remuneration and (Y) the prestige in which the vocation is held. This provides experience with an important foundational algorithm.

Lab 6 - Poisson Regression

Use Poisson regression to model the relationship between (X1) wind and (Y) ozone levels. This looks at one of the typical cases where linear regression is unsuitable and provides an introduction to the use of generalize linear models in R.

Lab 7 - Polynomial Regression

Use polynomial regression to model clearly non-linear synthetic data. This provides a clear example of basis transformation.

Lab 8 - Logistic Regression

Use logistic regression to model proportional and binary data. In the first case we model the proportion of women who have reached menarche versus their age in years, and in the second, the probability of a seed germinating versus its age in days. This gives us a second look at how to use generalized linear models in R, and provides experience with another important foundational algorithm.

Lab 9 - Linear Discriminant Analysis and Quadratic Discriminant Analysis

We use LDA and QDA to classify synthetic data. These provide a simple introduction to the use of Gaussian distributions, as well as exposure to these surprisingly well performing techniques.

Lab 10 - Random Forests and Adaboost

Use these tree-based ensemble techniques to model passenger survival in the Titanic disaster. Simple but very high performing, these methods are favorites of data scientists. These exercises provide experience using these important techniques in R.

Lab 11 - Putting it all together

Register

Data Science and Machine Learning Workshop, Doha (18-20 October 2016)

Data Science and Machine Learning Workshop, Dubai (14-16 February 2017)

Data Science and Machine Learning Workshop, BAHRAIN (13-15 December 2016)

Powered by Eventbrite

Study Guide: The Essentials of Data Analytics and Machine Learning

As a participant in this workshop, you will receive an exclusive copy of this study guide. The guide provides both a deep understanding of the techniques and practices of machine learning and exposes a wide set of resources capable of being wielded by the data scientist and analysts in their work. Readers will encounter explanations of the theory behind the algorithms and models they are exposed to, giving them an understanding of the strengths and weaknesses of each which they should be able to use to reason about suitable approaches to real life problem – and to communicate such reasoning to other stakeholders in such problems.

© 2017 Everati - All Rights Reserved