Classroom/ Online: Yes/ Yes
Scheduling Date(s):
Note: Please click specific date for detailed venue and course fee etc.
Data mining with Python
The growth in the volume, speed and complexity of data over the last decade has made it increasingly challenging to make sense of the world. In more and more cases, it has exceeded the capabilities of many tried and tested tools that many professionals have come to rely on. In today’s data dense world, new tools are needed. Python is one such tool; it has emerged as one of the more popular and requisite skills for professionals looking to build a career in Data Analytics or Data Science.
Many practitioners often start their analytics journeys with tools such as MS Excel. But as they progress in their careers so do the size and complexity of their data sets; to the point where it exceeds the capabilities of their analytics toolbox. Python is a practical and natural extension of the toolbox that manages to address many of Excel’s limitations while remaining relevant to the times. However, being a whole new platform and programming language, many struggle to make the transition to Python.
About the workshop
This 2 Day workshop aims to provide a broad but practical overview of the analytics value chain and the role that Python programming plays in it. It is targeted at delegates with limited to no programming background, looking to start their learning journey with Python. Delegates will get the opportunity to get their hands “dirty” with sizable real-world data sets and experience what it takes to import and prepare data for analytics, as well as mine the processed data for pivotal insights using Python. By the end of the workshop, delegates would get a clear understanding of programming concepts and the building blocks in the Python language, write Python scripts from scratch, leverage libraries and conduct analysis using Python with data visualisation.
Many practitioners often start their analytics journeys with tools such as MS Excel. But as they progress in their careers so do the size and complexity of their data sets; to the point where it exceeds the capabilities of their analytics toolbox. Python is a practical and natural extension of the toolbox that manages to address many of Excel’s limitations while remaining relevant to the times. However, being a whole new platform and programming language, many struggle to make the transition to Python.
About the workshop
This 2 Day workshop aims to provide a broad but practical overview of the analytics value chain and the role that Python programming plays in it. It is targeted at delegates with limited to no programming background, looking to start their learning journey with Python. Delegates will get the opportunity to get their hands “dirty” with sizable real-world data sets and experience what it takes to import and prepare data for analytics, as well as mine the processed data for pivotal insights using Python. By the end of the workshop, delegates would get a clear understanding of programming concepts and the building blocks in the Python language, write Python scripts from scratch, leverage libraries and conduct analysis using Python with data visualisation.
Objective
- Appreciate what analytics is and the role that Python programming plays in it
- Appreciate the basic elements in computer programming – syntax, data types, data structures, decision structures, iterations (for loops), and functions
- Appreciate why and when Python programming is practical and relevant in analytics
- Learn how to build Python code from scratch; and how to leverage public libraries to enhance your Python code for data analytics
- Learn how to conduct analysis and solve real-world problems using Python
Outline
Day 1
1. What is Analytics and what it takes to build sustainable capabilities
2. The Analytics value chain and the role of Data Cleaning and Processing
3. Introduction to Python Programming
4. Building your Python Vocabulary
Day 2
1. Introduction to Pandas (Python Data Analysis Library)
2. Visualizing data with Python
3. Capstone Data Project
1. What is Analytics and what it takes to build sustainable capabilities
- This segment seeks to define what analytics is and isn’t, why it matters in today world and what it takes to build sustainable value
2. The Analytics value chain and the role of Data Cleaning and Processing
- An introduction to the Analytics Value Chain; and appreciate the role that Python programming plays in it.
- Analytics in Action – An actual case study of Python programming creating value for organizations
3. Introduction to Python Programming
- Defining what Python programming is and isn’t
- Understand the key benefits and limitations of Python programming and why it matters in analytics
- Appreciate some of the key applications of Python in today’s world
4. Building your Python Vocabulary
- Understanding data types, data structures, decision structures, and iterations in Python
- Appreciate and build Python functions from scratch
- Introduction to Python libraries for data analysis
- Hands on follow along exercises to build Python programmes from scratch
Day 2
1. Introduction to Pandas (Python Data Analysis Library)
- Understanding the data structure in Pandas (Series and Dataframe)
- Pandas series (create, access data, changing data, checking for null values, boolean selector)
- Pandas dataframe (create, changing the index of the rows and columns, adding new series to a dataframe)
- Learn how Pandas can be used to conduct many analytics functions and operations, such as data cleaning and processing, statistical analysis
- Hands on follow along exercises to use Pandas for data cleaning and analysis
2. Visualizing data with Python
- Data visualization is an increasingly vital and popular tool in analytics; for analytical purposes as well as for communication.
- Introduction to the Matplotlib for data visualization with Python
- Hands on follow along exercises to draw Line charts, Bar Charts, Scatterplot, Histogram and Box plots using Matplotlib to generate insights
3. Capstone Data Project
- Delegates will download a large data set and apply their newfound skills to clean and process data using Pandas
- Delegates will then conduct analysis of the data set using Pandas and Matplotlib to identify insights as well as answer questions
- Delegates will then share their finds and approach with each other to facilitate learning and retention
Who should attend
- Professionals with zero knowledge in programming (prior MS Excel would be helpful)
- Professionals who are aspiring to a career in Data Science or Analytics looking to broaden their toolbox and skill sets
- Professionals with data science or analytics responsibilities who have reached or are reaching the limits of their current toolbox and are considering Python programming as the next step on their learning journey
- All other professionals with a keen interest in building a strong foundation in Python programming
Methodology
This is a 2-day hands-on course.
Logistics Requirements
Participants are required to bring their own laptops with Anaconda (Python) installed. Installation guides will be provided and installation validated before class.
Workshops are designed to be delivered in Face to Face formats
Logistics Requirements
Participants are required to bring their own laptops with Anaconda (Python) installed. Installation guides will be provided and installation validated before class.
Workshops are designed to be delivered in Face to Face formats
Testimonials
“The trainer was patient in troubleshooting troubles faced by the participants, and it was useful being able to practice what was being taught. ”
“A lot of practice given to us and this able to let us understand better.”
“FYTs data analytics workshop was practical and engaging. The trainer shared useful techniques, tips and shared relatable real-life examples. Right after the course, I was able to create an interactive dashboard to share with the team.”
“A lot of practice given to us and this able to let us understand better.”
“FYTs data analytics workshop was practical and engaging. The trainer shared useful techniques, tips and shared relatable real-life examples. Right after the course, I was able to create an interactive dashboard to share with the team.”
Profile of Admond Lee
BSc, Data Scientist, Senior Consultant
- A scientist by training, Admond is a consulting data scientist and trainer; with his mission to make data science accessible to everyone. Admond helps companies to truly leverage the power of data analytics and drive business values.
- Prior to joining FYT, he was a data scientist at Micron and built end-to-end data science solution for smart manufacturing process, saving manufacturing costs of up to $4M.
- Admond has consulted with multiple companies on data science projects and has regular speaking and training engagements for clients and at conferences.
- Being a passionate data science writer, his data science work and experience have been featured by various publications, including KDnuggets, Medium, Tech in Asia, AI Time Journal and local business magazine.
- Admond holds a Bachelor’s Degree in Physics and Applied Physics from the Nanyang Technological University (NTU). Fluent in English, Mandarin, Cantonese, Malay.