Following quiz provides Multiple Choice Questions (MCQs) related to Sqoop. You will have to read all the given answers and click over the correct answer. If you are not sure about the answer then you can check the answer using Show Answer button. You can use Next Quiz button to check new set of questions in the quiz.
Q 1 - To run sqoop from multiple nodes, it has to be installed in
On installing in one node it, automatically gets replicated to other nodes in the cluster.
Q 2 - What are the two different incremental modes of importing data into sqoop?
The --incremental parameter is used to fetch only the new data (data which does not already exist in hadoop) . It is done as an append if there are columns specified to be checked for new data. it cal also use the last modified parameter which will use the last_updated_date column from the existing table to identify the new row.
Q 3 - What is achieved by using the --meta-connect parameter in a sqoop command?
with the --meta-connect parameter the metastore starts running as a service with the default port 16000.Now this metastore service becomes accessible throughout the cluster.
Q 4 - While inserting data into Relational system from Hadoop using sqoop, the various table constraints present in the relational table must be
We must verify that the data being exported does not violate the constraints error.
Q 5 - Using the higher value for the parameter sqoop.export.statements.per.transaction will
In the scenario when the database requires table_level write lock, higher value of sqoop.export.statements.per.transaction will lock the table for a longer time and will decrease the performance.
Q 6 - Can the upsert feature of sqoop delete some data form the exported table?
Sqoop will never delete data as part of upsert statement.
Q 7 - If the table to which data is being exported has more columns than the data present in the hdfs file then
The load can still be done by specifying the –column parameter to populate a subset of columns in the relational table.
Q 8 - If the target hive table is partitioned then sqoop behavior is which of the following?
Sqoop supports loading into hive partitions using additional parameters in the sqoop command.
Q 9 - The parameter used to identify the individual row in HBase while importing data to it using sqoop is
the parameter --hbase-row-key is used in sqoop to identify each row in the HBase table.
Q 10 - The comparison of row counts between the source system and the target database while loading the data using sqoop is done using the parameter
The –validate parameter is used to show the result of row comparison between source and target.