• LOGIN
  • No products in the cart.

Top 25 Talend Data Quality Interview Questions and Answers in 2020 – GoLogica

What do you mean by Talend?

Talend open studio is the open source data integration product produced by Talend and it is designed to convert, combine and update data in various areas across a business.

What is the significance of tLoqateAddressRow component in Talend?

It is a component for mailing correct address belongs to the respective customer data to make sure a single customer view and good delivery for their respective customer mailing.

Differentiate between Repository and Built-In?

In Built-In we can manually edit the data as data is kept locally in the job whereas in repository all the data is stored there only. We can extract only Read-only-information into the job from repository.

What’s new in v5.6?

This technical note highlights the important new features and capabilities of version 5.6 of Talend’s comprehensive suite of Platform, Enterprise and Open Studio solutions.

With version 5.6 Talend:

  • Extends it big data leadership position enabling firms to move beyond batch processing and into real-time big data by providing technical previews of the Apache Spark, Apache Spark Streaming and Apache Storm frameworks.
  • Enhances its support for the Internet of Things (loT) by introducing support for key loT protocols (MQTT, AMQP) to gather and collect information from machines, sensors, or other devices.
  • Improves Big Dta performance: Map Reduce executes on average 24% faster in v5.6 and 53% faster than in v5.4, while Big Data profiling performance is typically 20 times faster in v5.6 compared to v5.5.
  • Enables faster updates to MDM data models and provides deeper control of data lineage, more visibility and control.
  • Offers further enterprise application connectivity and support by continuing to add to its extensive list of over 800 connectors and components with enhanced support for enterprise applications such as SAP BAPI and Tables, Oracle 12 GoldenGate CDC, Microsoft HDInsight, Marketo and Salesforce.com

What are the operations which tMap allows?

  • data transformation on any type of fields
  • data multiplexing and demultiplexing
  • fields concatenation and interchange
  • data rejecting
  • field filtering using constraints

What is tJoin?

tJoin joins two tables by doing an exact match on several columns. It compares columns from the main flow with reference columns from the lookup flows and outputs the main flow data and/or the rejected data.

Can we use Binary Transfer mode in SFTP?

No, in SFTP we cannot use the binary transfer mode because SFTP is not like the FTP. Hence, we cannot apply the concept like ‘current mode directory’ and ‘transfer mode’.

Differentiate between ‘insert or update’ and ‘update or insert’.

  • Insert or update means first we insert a record, but if a record is matching with the primary key then the record is updated.
  • Update or insert means first we update the record with same primary and if the record doesn’t exists then we insert the record.

Differentiate between OnComponentOk and OnSubjectOk ?

They both are trigger links which can connect to another subject job. The major difference between both of them is that they both lies in the execution order of the connected subjobs.

Define tMap?

tMap is the latest component which simply converts and routes data from one or many sources to one or many destinations.

Difference between tjava, tjavarow and tjavaflex component in talend?

  • tjava – execute only once
  • tjavarow – execute for each row
  • tjavaflex – if we want to perform specific task at certain time for that row. tjavaflex contains three sections: Start (code written here will execute before processing any row), Main (code will execute for each row), End (code will execute after processing every row).

List the usage of tmap component?

  • map the data from one or more sources to one or more destination.
  • Data transformation
  • Filtering of data
  • Rejection of data

How can we execute job on remote server in local Talend Studio?

Firstly configure remote server – Go to Window, -> Preferences ->Talend ->Run/Debug -> Remote -> Add remote server details.

And Now to execute the job on remote server  go to Run job view -> Target Exec tab -> select the remote server and then run the job.

Why ‘GC overhead limit exceeded’ error occurs?

Error occurs when Garbage collector memory is fully utilized and no more memory left to process further. Generally in talend we will see such scenarios when job handling huge lookup data. Increase in -Xmx jvm argument will help in solving the issue.

How can you store job logs using AMC?

We can activity AMC by enabling user errors,warning and run time error in stats and log tab under job window.either we can log the entries in DataBase or Files or Console.

Talend data quality training

Null handling in Talend?

There are 90% chances of job failure due to Null handling exception. In order to avoid the exception, it is advisable to check column for null value and then do transformations on that column.

Handling Hierarchical XML file and extracting data?

We can use tfileMSXMLinput, tfileMSXMLoutput components to extract data or create XML structure from Hierarchical XML having multiple loops at different level.

What is context variables?

Variables having different values as per the environments. We define values for variables used in a job in DEV or PROD as context so that we can choose the values of these variables during run time without changing the job.

How many types of triggers present in TAC/Talend?

3 types of triggers we can use to schedule the job:

  • Normal trigger
  • Cron trigger
  • File trigger

Why use the Palette setting in talent?

The palette setting allows launching Talend Open Studio tool more quickly as only the current component is needed to load in the project.

What is mean by Routines in Talend?

Routines are complicated Java functions, which are used to factorize code. It helps to recover Job capacities and optimized data procedure.

What is Migration Task in Talend?

Migration Task ensures the fullness of a project which was developed using the older version of Talend.

Is it possible to change the background colour of job designer in Talend?

Yes, it is possible to design background by selecting the preferences of the window menus, then buy just clicking on the colour menu you can design background colour.

Benifits of using tMap over tJoin:

We can perform inner and outer join using tMap. Also we can filter data using tMap. Also we can create multiple outputs using tMap.

What is the meaning of MDM with reference to Talend?

MDM stands for Master Data Management. With the help of MDM organization can build and manage a single, consistent and accurate view of the enterprise data. MDM helps to increase business value by improving operational efficiency, marketing effectiveness, planning and regulatory compliance.

May 8, 2020
GoLogica Technologies Private Limited  © 2019. All rights reserved.