Become a data scientist quickly.
Go hands-on with the latest neural network, artificial intelligence, and data science techniques employers are seeking!
Created By : ZP
226 Graduates Part Time 6 weeks 2.5 h/session
Become a data scientist quickly.
- Exploration of Data
- DataFrames and Series
- Data Aggregation for Reporting
- Data Preparation: Cleaning, Transforming, Data Fusion
- Introduction to Unsupervised Learning (clustering and data compression)
- Introduction to Supervised Learning (linear regression, and briefly, logistic regression, decision trees, & support vector classifiers)
- numpy Arrays
- Linear Regression: Talking about the Learning in Machine Learning
- Feature Generation
- Underfitting and Overfitting: Qualitatively and Quantitatively
- Model Selection in Linear Regression
- Another Regressor: The Decision Tree Regressor
- Mixing Methods: Clustering and Predictors
- Classification Performance: Confusion Matrix
- Classification Performance: The ROC Curve
- Classification Performance: Linking Classifier Performance to Business Outcomes
- Making Sense of Logistic Regression
- Visualizing Decision Trees
- Extracting Information from Decision Trees
- Visualizing Classification Boundaries
- Naive Bayes for Classification
- Item-Item Similarity-based Collaborative Filtering
- Exploring a Ratings Data Set and Similarity as a Basis for Recommendations
- Beginning to work on our Item-Item Similarity-based Collaborative Filtering Method
- Broadcasting in numpy
- Iterating on the Item-Item Similarity-based Collaborative Filtering Method
- Writing a performant Item/User-Based Collaborative Filtering method
- The Surprise package and other methods for collaborative filtering
Students need to have a basic understanding of the Python language.
If you do not have any mastery over Python, you are encouraged to join our Python Development course.
The course is a highly practical introduction to data-science using Python as a language of choice for its wide variety of libraries and tools designed for the subject.
Students will learn to use widely-used libraries: pandas for data preparation and reporting, numpy for efficient array computations and scikit-learn for unsupervised learning (clustering and data compression) and supervised learning (linear regression, logistic regression, decision trees, and support vector classifiers). Students will also be introduced to the process of designing their own machine learning method through recommendation systems. The provided course materials also will serve as a useful reference (of practical examples) that supports post-course revision and practice. Students who gain competence in the materials (a function of effort) will be ready to enhance their work with elements of data science.
Students are required to have basic knowledge of programming before signing up for this course.
If you want to pick up basic programming skills, please register for our Python Development course instead.
Students will learn how to perform the various parts of the "data science process":
- Preparing data: Cleaning, transforming, etc.
- Selecting a machine learning model (algorithm & settings)
- Training a model
- Evaluating a model
- Making predictions
The course is a data-centric introduction to Python for beginners with no coding background. Python will be introduced through different areas of application.
Data Science Introduction (Python)
The course bridges the gap between Programming and Data Science, focusing on the libraries for introducing Data Science, Web Development, and basic statistics knowledge.