Course Details
- Category
- Data Warehousing(ETL)
- Mode of Training
- Online/Offline
- Duration
- 3 - 4 months
- Fees
- ₹ 2000
Course Features
Instructor led Sessions
Real-life Case Studies
Assignment
Certification
Instructor led Sessions
Talend Training
Oranium Tech introducing some amazing content on Talend. Talend is an ETL tool for Data Integration. It provides software solutions for data preparation, data quality, data integration, application integration, data management and big data. Talend has a separate product for all these solutions. Data integration and big data products are widely used.
Course Syllabus
• www.dw-learnwell.com
• SQL
• Hadoop/Big Data
• Data Analytics – R programming/Python
• Studio Definitions
• Starting the studio
• Configuring your own Talend View
• Creating the project
• Creating an example job
• Opening/Creating a Business Model
• Modelling a Business Model
• Assigning Repository elements to a Business Model
• Creating a Built-in Schema
• Propagating Schema Changes
• Creating a generic schema from existing metadata
• Dropping Schema to empty components
• Cutting and pasting schema information
• Overview of tMap
• Connecting multiple inputs to tMap
• Creating joins in tMap
• Catching Rejects in tMap
• Using Expression Builder in tMap
• Using Variables in tMap
• Performance tuning of look ups in tMap
• Classes and Objects
• Writing Hello Java Program
• OOPS Concepts
• Basics of Java
• Method overloading
• Static keyword
• Abstract Modifiers
• Handling Java Strings
• Creating Routines in Talend
• Using Custom Code in Talend
• Using tJava component
• Using tJavaRow component
• Using tJavaFlex component
• tDBConnection
• tDBInput
• tDBoutput
• tDBRow
• tMySqlColumnList
• tFileList
• tFlowtoIterate
• tIteratetoFlow
• tUniqRow
• tReplaceList
• tSchemaComplianceCheck
• tJoin
• tFilterRow
• tAggregateRow
• tSortRow
• tReplace
• tAggregateSortedRow
• tNormalize
• tDeNormalize
• Using tJavaFlex
• Capturing Statistics at Project Level
• Using tWarn, tDie, tFlowMeter
• Using tFlowMeterCatcher, tLogcatcher
• Using Global Variables in Talend
• Creating Context variables
• Creating Context group
• Usage of Prompts in Talend Jobs
• Running Jobs in different contexts
• Using tContextLoad
• Using tContextDump
• Using tRunJob
• Passing parameters to Sub Job
• Passing context variables from Parent Job to Child Job
• Passing context variables from Child Job to Parent Job
• Creating a task in Job Conductor
• Creating execution plan in TAC
• Performance Tuning in tMap
• Performance Tuning using Subjobs
• Using ELT Components
• Adding Break points
• Debug Run in Talend Studio
• Adding Break points
• Debug Run in Talend Studio
1. Subject -Oriented
2. Integrated
3. 3Non – Volatile
4. 4Time Varying
1. Data Source Layer
2. Data Extraction Layer
3. Staging Layer
4. ETL Layer
5. Data Storage Layer
6. Data Logic Layer
7. Data Presentation Layer
8. Metadata Layer
9. System Operation Layer
1. Additive Facts
2. Semi Additive Facts
3. Non – Additive Fact
4. Cumulative
5. Snapshot
1. Star Schema
2. Snow Flake Schema
3. Fact Constellation Schema
4. Slow Changing Dimension
1. SCD1 – Advantages/ Disadvantages
2. SCD2 – Advantages/ Disadvantages
3. SCD3 – Advantages/ Disadvantages
• Difference between OLAP and OLTP
Types Of OLAP
1. Multi-Dimensional (MOLAP)
2. Relational(ROLAP)
3. Hybrid(HOLAP)