Analiza danych Certificate for Kamila Żur
Certificate ID:
754599
Authentication Code:
5f166
Certified Person Name:
Kamila Żur
Trainer Name:
Anna Kotarba
Duration Days:
3
Duration Hours:
21
Course Name:
Analiza danych
Course Date:
27 February 2024 09:00 to 29 February 2024 16:00
Course Outline:
- Big data fundamentals
- Big Data and its role in the corporate world
- The phases of development of a Big Data strategy within a corporation
- Explain the rationale underlying a holistic approach to Big Data
- Components needed in a Big Data Platform
- Big data storage solution
- Limits of Traditional Technologies
- Overview of database types
- The four dimensions of Big Data
- Big data impact on business
- Business importance of Big Data
- Challenges of extracting useful data
- Integrating Big data with traditional data
- Big data storage technologies
- Overview of big data technologies
- Data storage models
- Hadoop
- Hive
- Cassandra
- MongoDB
- Choosing the right big data technology
- Overview of big data technologies
- Processing big data
- Connecting and extracting data from database
- Transforming and preparation data for processing
- Using Hadoop MapReduce for processing distributed data
- Monitoring and executing Hadoop MapReduce jobs
- Hadoop distributed file system building blocks
- Mapreduce and Yarn
- Handling streaming data with Spark
- Big data analysis tools and technologies
- Programming Hadoop with Pig Latin language
- Querying big data with Hive
- Mining data with Mahout
- Visualizing and reporting tools
- Big data in business
- Managing and establishing Big Data needs
- Business importance of Big Data
- Selecting the right big data tools for the problem
Data Warehousing Concepts
- What is Data Ware House?
- Difference between OLTP and Data Ware Housing
- Data Acquisition
- Data Extraction
- Data Transformation.
- Data Loading
- Data Marts
- Dependent vs Independent data Mart
- Data Base design
ETL Testing Concepts:
- Introduction.
- Software development life cycle.
- Testing methodologies.
- ETL Testing Work Flow Process.
- ETL Testing Responsibilities in Data stage.
Big data Fundamentals
- Big Data and its role in the corporate world
- The phases of development of a Big Data strategy within a corporation
- Explain the rationale underlying a holistic approach to Big Data
- Components needed in a Big Data Platform
- Big data storage solution
- Limits of Traditional Technologies
- Overview of database types
NoSQL Databases
Hadoop
Map Reduce
Apache Spark
Course Name Eng:
Big Data - Data Science