Data Integration ETL with Talend Open Studio Zero to Hero
What you’ll learn
connect your data sources, such as files, databases, XML, web services, Google Drive and more formats
build your own integration processes using practical examples and comprehensive scenarios
master the most important transformations like mappings, joins, aggregations and sorting
orchestrate processes into larger units by using preJobs, postJobs, variable and hierachies
Requirements
Interest in data and in bringing it together
Computer/Laptop with 4+GB RAM and current Java Runtime
Description
Data. Everywhere. All well-behaved in their own environment. But who actually lets them talk to each other? You do. With data integration. Become a data savant and add value with ETL and your new knowledge!Talend Open Studio is an open, flexible data integration solution. You build your processes with a graphical editor and over 600 components provide flexibility.Each section has a practical example and you will receive this complete material at the beginning of the course. So you can not only view each section, but also compare it to your own solution. There are also extensive practical scenarios included. So you’ll be well equipped for practice!What are the biggest topics you can expect?Installation on different operating systems (Windows, Linux, Mac)understanding and using important data typesreading and writing from databasesprocess different file formats, like Excel, XML, JSON, delimited, positionalcreate and use metadatabuild schemasuse helpful keyboard shortcutsretrieve data from WebServices / RESTconnect to GoogleDrive and fetch datausing iteration and loopsconvert data flows into iterationsbuild and understand job hierarchiesAll major transformations: Map, join, normalize, pivot, and aggregate datacreate and extract XML and JSONuse regular expressionsOrchestrate components in processesCheck and improve data qualityUse fuzzy matching and interval matchingUse variables for different environmentsPerform schema validationHandle reject data separatelyFind and fix errors quicklyWrite meaningful logsInclude and react to warnings and abortsBuild job hierarchies and pass data between different levelsimplement and test your own assumptionsconfigure your project for logging, versioning and context loadinglearn best practices and establish your owndocument items and have documentation generatedWhat are you waiting for? See you in the course!
Overview
Section 1: Course Overview
Lecture 1 Course Overview
Lecture 2 Udemy Tipps
Section 2: Why Data Integration?
Lecture 3 Why Data Integration?
Section 3: Setup for the course
Lecture 4 Intro
Lecture 5 Installation on Windows, Mac or Linux
Lecture 6 Language of Talend Open Studio
Lecture 7 Import course project
Lecture 8 Import the course project for version 7.3.1
Lecture 9 Download course files
Lecture 10 Explore the course project
Section 4: “Hello world” example
Lecture 11 Intro
Lecture 12 “Hello world” with tMsgBox
Section 5: Get to know the User Interface (UI)
Lecture 13 Intro
Lecture 14 Get to know the user interface
Lecture 15 Useful helpers
Lecture 16 Install external libraries
Lecture 17 Find errors
Lecture 18 Where is my tab?
Section 6: Your first ETL job
Lecture 19 Intro
Lecture 20 Your first job
Lecture 21 Get Component Help
Section 7: File Processing Basics
Lecture 22 Intro
Lecture 23 Read files
Lecture 24 Write files
Lecture 25 List folder contents
Lecture 26 File processing scenario
Section 8: Understand properties in Talend
Lecture 27 Intro
Lecture 28 BuiltIn vs. Repository
Lecture 29 Schema
Lecture 30 Data types
Lecture 31 Connection types
Lecture 32 Ctrl + Spacebar
Section 9: Process relational databases
Lecture 33 Intro
Lecture 34 Your database
Lecture 35 Connect to a database
Lecture 36 Write to a database
Lecture 37 Read from a database
Lecture 38 Iterate over database contents
Lecture 39 Database processing Scenario
Section 10: Process other formats: Cloud, JSON, REST, …
Lecture 40 Intro
Lecture 41 Process JSON
Lecture 42 Process XML
Lecture 43 Google Drive access
Lecture 44 Use Google Cloud Storage
Lecture 45 Query REST service
Lecture 46 File downloads
Lecture 47 Read RSS feeds
Section 11: Use variables in jobs
Lecture 48 Intro
Lecture 49 Create context variables
Lecture 50 Create environments for variables
Lecture 51 Data types in context
Lecture 52 Central context
Lecture 53 Create context for metadata
Lecture 54 Write and load context
Lecture 55 Load context implicitly
Lecture 56 Global variables
Section 12: Transformation components & usage
Lecture 57 Intro
Lecture 58 Filter data
Lecture 59 Sort data
Lecture 60 Aggregate data
Lecture 61 Convert data types
Lecture 62 Split data
Lecture 63 De-Normalization
Lecture 64 Simple joins
Lecture 65 Install Sakila
Lecture 66 Mapping with Join
Lecture 67 Mapping with filter
Lecture 68 Mapping with transformation
Lecture 69 Field extraction: Intro
Lecture 70 Field extraction: JSON
Lecture 71 Field extraction: XML
Lecture 72 Field extraction: Positional
Lecture 73 Field extraction: Delimited
Lecture 74 Field extraction: Regex
Lecture 75 Field extraction: Outro
Lecture 76 Generate XML and JSON fields
Lecture 77 Generate sample data
Lecture 78 Pivot data
Section 13: Data quality components
Lecture 79 Intro
Lecture 80 Remove duplicates
Lecture 81 Match intervals
Lecture 82 Make substitutions
Lecture 83 Schema checking
Lecture 84 Redundancy key
Lecture 85 Fuzzy matching
Section 14: File Management operations
Lecture 86 Intro
Lecture 87 Basic operations
Lecture 88 File comparison
Lecture 89 File properties
Lecture 90 File lists
Lecture 91 File compression
Lecture 92 Temporary files
Section 15: Job orchestration like hierarchies and subjobs
Lecture 93 Intro
Lecture 94 Pre- & Post-Job
Lecture 95 MessageBox
Lecture 96 Replicate and unite
Lecture 97 Data flow to iteration & vice versa
Lecture 98 Loops
Lecture 99 Wait and measure
Lecture 100 System interaction
Lecture 101 Job hierarchies
Lecture 102 Subjob Triggers
Lecture 103 Component Triggers
Lecture 104 Conditional Triggers
Section 16: Logging configuration and components
Lecture 105 Intro
Lecture 106 Debug data
Lecture 107 Test assertions
Lecture 108 Log volumes
Lecture 109 Log errors & warnings
Lecture 110 Log executions
Lecture 111 Log jobs
Lecture 112 Log jobs better
Lecture 113 Log projects
Section 17: Documentation & Best Practices
Lecture 114 Intro
Lecture 115 Use best practices
Lecture 116 Create documentation
Section 18: Job Deployment outside Talend Open Studio
Lecture 117 Intro
Lecture 118 Create and execute job code
Lecture 119 Change parameters
Section 19: Project Handling
Lecture 120 Intro
Lecture 121 Create and delete projects
Lecture 122 Export and Import
Lecture 123 Project Settings
Section 20: Use Cases
Lecture 124 Intro
Lecture 125 Write a Excel file
Lecture 126 Read a multi-schema XML
Lecture 127 Append to a text file
Lecture 128 Sync Tables Using Talend
Section 21: Course conclusion
Lecture 129 Congratulations
Section 22: Extra Material
Lecture 130 Talend Cheatsheet
Lecture 131 A small test
Lecture 132 A big test
Lecture 133 Template job
Lecture 134 Bonus Lecture
you want to bring together different data sets quickly and easily,you have challenges to let your data talk to each other,you are interested in a career in data space,future ETL developers
Course Information:
Udemy | English | 8h 3m | 3.18 GB
Created by: Samuel Lenk | Certified Talend Java SQL Data Integration ETL Developer
You Can See More Courses in the Developer >> Greetings from CourseDown.com