- Locations: Jeddah Dubai Abu Dhabi Eastern Province - Saudi Arabia Riyadh
- Duration: Upto 3 Days
Course details
OverviewThis course Provides instruction on the processes and practice of data science, including machine learning and natural language processing. Included are: tools and programming languages (Python, IPython, Mahout, Pig, NumPy, pandas, SciPy, Scikitlearn), the Natural Language Toolkit (NLTK), and Spark MLlib.
Duration
3 days
Course Objectives
• Recognize use cases for data science on Hadoop
• Describe the Hadoop and YARN architecture
• Describe supervised and unsupervised learning differences
• Use Mahout to run a machine learning algorithm on Hadoop
• Describe the data science life cycle
• Use Pig to transform and prepare data on Hadoop
• Write a Python script
• Describe options for running Python code on a Hadoop cluster
• Write a Pig User-Defined Function in Python
• Use Pig streaming on Hadoop with a Python script
• Use machine learning algorithms
• Describe use cases for Natural Language Processing (NLP)
• Use the Natural Language Toolkit (NLTK)
• Describe the components of a Spark application
• Write a Spark application in Python
• Run machine learning algorithms using Spark MLlib
• Take data science into production
Updated on 27 June, 2018
Eligibility / Requirements
Students must have experience with at least one programming or scripting language, knowledge in statistics and/or mathematics, and a basic understanding of big data and Hadoop principles. Students new to Hadoop are encouraged to attend the HDP
About Agilitics Pte. Ltd.
Agilitics Pte. Ltd. is Singapore headquartered, Data and Business Analytics focussed company. We are the real experts of the big data domain.
Established in 2013, Head quartered at Singapore,
Agilitics Pte Ltd is a leading Big Data Analytics and Agile Consulting and Training solutions provider
Our Tagline is Agility + Analytics Delivered.
We offer a comprehensive range of Big data ecosystem and Agile management solution, services and expertise for Information Management, Data Analytics, Machine Learning, Artificial Intelligence and Smart City Solutions
Data Science Related Questions
- JavaScript Full stack web developer virtual internship Virtual Bootcamp + Internship at LaimoonAED 1,449Duration: Upto 30 Hours
- GDPR : 7 Professional Bundle JanetsSAR 236Duration: Upto 12 Weeks
- SAR 56Duration: Upto 6 Hours