Saved Bookmarks
This section includes InterviewSolutions, each offering curated multiple-choice questions to sharpen your knowledge and support exam preparation. Choose a topic below to get started.
| 1. |
Where can the metastore database be hosted? |
| Answer» | |
| 2. |
Which database the sqoop metastore runs on? |
| Answer» | |
| 3. |
Give the sqoop command to see the content of the job named myjob? |
| Answer» | |
| 4. |
How can you see the list of stored jobs in sqoop metastore? |
| Answer» | |
| 5. |
What is the purpose of sqoop-merge? |
| Answer» | |
| 6. |
What is a sqoop metastore? |
| Answer» | |
| 7. |
Give a command to execute a stored procedure named proc1 which exports data to from MySQL db named DB1 into a HDFS directory named Dir1. |
| Answer» | |
| 8. |
Give a sqoop command to import data from all tables in the MySql DB DB1. |
| Answer» | |
| 9. |
Give a Sqoop command to import all the records from employee table divided into groups of records by the values in the column department_id. |
| Answer» | |
| 10. |
What does the following query do? |
| Answer» | |
| 11. |
Give a sqoop command to run only 8 mapreduce tasks in parallel |
| Answer» | |
| 12. |
Give a sqoop command to import the columns employee_id,first_name,last_name from the MySql table Employee |
| Answer» | |
| 13. |
What are the two file formats supported by sqoop for import? |
| Answer» | |
| 14. |
How to import only the updated rows form a table into HDFS using sqoop assuming the source has last update timestamp details for each row? |
| Answer» | |
| 15. |
How can you control the mapping between SQL data types and Java types? |
| Answer» | |
| 16. |
What happens when a table is imported into a HDFS directory which already exists using the –apend parameter? |
| Answer» | |
| 17. |
What does this sqoop command achieve? |
| Answer» | |
| 18. |
What is the importance of --split-by clause in running parallel import tasks in sqoop? |
| Answer» | |
| 19. |
In a sqoop import command you have mentioned to run 8 parallel Mapreduce task but sqoop runs only 4. What can be the reason? |
| Answer» | |
| 20. |
How can you force sqoop to execute a free form Sql query only once and import the rows serially. |
| Answer» | |
| 21. |
What do you mean by Free Form Import in Sqoop? |
| Answer» | |
| 22. |
Give a sqoop command to show all the databases in a MySql server. |
| Answer» | |
| 23. |
Sqoop imported a table successfully to HBase but it is found that the number of rows is fewer than expected. What can be the cause? |
| Answer» | |
| 24. |
How can you schedule a sqoop job using Oozie? |
| Answer» | |
| 25. |
How can we load to a column in a relational table which is not null but the incoming value from HDFS has a null value? |
| Answer» | |
| 26. |
How can you export only a subset of columns to a relational table using sqoop? |
| Answer» | |
| 27. |
How can you sync a exported table with HDFS data in which some rows are deleted? |
| Answer» | |
| 28. |
How will you update the rows that are already exported? |
| Answer» | |
| 29. |
How do you clear the data in a staging table before loading it by Sqoop? |
| Answer» | |
| 30. |
How will you implement all-or-nothing load using sqoop? |
| Answer» | |
| 31. |
What is the difference between the parameters sqoop.export.records.per.statement and sqoop.export.statements.per.transaction |
| Answer» | |
| 32. |
Before starting the data transfer using mapreduce job, sqoop takes a long time to retrieve the minimum and maximum values of columns mentioned in –split-by parameter. How can we make it efficient? |
| Answer» | |
| 33. |
How can you choose a name for the mapreduce job which is created on submitting a free-form query import? |
| Answer» | |
| 34. |
How can we slice the data to be imported to multiple parallel tasks? |
| Answer» | |
| 35. |
How do you fetch data which is the result of join between two tables? |
| Answer» | |
| 36. |
Is it possible to add a parameter while running a saved job? |
| Answer» | |
| 37. |
What is the usefulness of the options file in sqoop. |
| Answer» | |
| 38. |
When the source data keeps getting updated frequently, what is the approach to keep it in sync with the data in HDFS imported by sqoop? |
| Answer» | |
| 39. |
How can you avoid importing tables one-by-one when importing a large number of tables from a database? |
| Answer» | |
| 40. |
How can you control the number of mappers used by the sqoop command? |
| Answer» | |
| 41. |
What is a disadvantage of using --direct parameter for faster data load by sqoop? |
| Answer» | |
| 42. |
What is the significance of using --compress-codec parameter? |
| Answer» | |
| 43. |
What is the default extension of the files produced from a sqoop import using the --compress parameter? |
| Answer» | |
| 44. |
What is the advantage of using --password-file rather than -P option while preventing the display of password in the sqoop import statement? |
| Answer» | |
| 45. |
How can we import a subset of rows from a table without using the where clause? |
| Answer» | |
| 46. |
How can you import only a subset of rows form a table? |
| Answer» | |
| 47. |
When to use --target-dir and when to use --warehouse-dir while importing data? |
| Answer» | |
| 48. |
Is JDBC driver enough to connect sqoop to the databases? |
| Answer» | |
| 49. |
What is the role of JDBC driver in a Sqoop set up? |
| Answer» | |