How to read data from an Excel file with Talend?

Would you like to explore a tool that can make it easier to interpret and simplify your daily data? Here is it…

Talend is a modern data management platform that offers a comprehensive suite of capabilities to handle data and drive business value efficiently.

It works with any data environment or framework and supports cloud, multi-cloud, hybrid, or on-premises environments.

Talend offers a modular solution accommodating data needs at any scale or complexity. Their key features include data integration, quality, governance, and application and API integration. Talend is flexible, scalable, and cloud-independent, supporting various architectures.

With Talend, organizations can turn their data into a trusted asset and tackle key initiatives for strategic business outcomes.

We will see the intricate yet powerful process of reading data using Talend, and we’ll navigate through key components that play a pivotal role in this process of reading data from an Excel file for effective logging and monitoring.

Talend offers Input components for the data reading, for example, TFileInputExcel, for an Excel Source. This Component lets us configure the file, define sheets, and set the mapping of the columns.

You can use diverse Excel formats, providing flexibility and manipulation for the data structures.

Step 1: Create Connection

→ In Repository, find the metadata and then create the connection that you need for the job (ex. We have an Excel file – we use File Excel for the connection)

Step 2: Create the Job

→ In the Repository, find Job Designs and create a new job with the desired name

Step 3: Add tFileInputExcel Component

→ Create the tFileInputExcel component

Double-click on tFileInputExcel to open its Component view

→ Set the file path, and sheet name, and configure other options based on your Excel file structure

→ Define the schema by clicking on the “Edit Schema” button. This step is crucial to define the structure of the data that will be read

Step 4: Add tMap Component

→ Create the tMap component

→ Connect the output of tFileInputExcel to the input of tMap

→ Double-click on tMap to open its Mapping Editor

→ In the Mapping Editor, you’ll see input and output tables

Map the columns from the input (tFileInputExcel) to the desired columns in the output

→ Apply any transformations or calculations if needed

→ Define the output schema by clicking on the “Edit Schema” button

Step 5: Add tLongRow Component

→ Create the tLogRow component

→ Connect the output of tMap to the input of tLogRow

→ Configure tLogRow

→ Double-click on tLogRow to open its Component view

→ Select the desired schema and then the mode you need when you read the data

Step 6: Run the Job

→ Save your job

→ Click on the Run button to execute the job

→ Open the console or Run view to see the output logged by tLogRow

 

Here is the result in Talend:

Watch the video tutorial here:

About btProvider:

We hold all possible technical and sales certifications for all products:

Talend, Tableau Desktop, Tableau Public, Tableau Server, Tableau Prep, Tableau Data Management, Vertica, Salesforce, Mulesoft, Write-Back.

With Talend – the best data management platform – you can always understand your data better. Using Talend, you will make intelligent and strategic decisions and seamlessly integrate, assure quality, and govern your data effortlessly. See other #skillpill video tutorials here:

How to create a Barcode Chart

How to create a top N Set

Using measure names in parameter actions

Are you interested in learning more about Talend – the catalyst for efficient and comprehensive data management solutions across your company.

Contact us!