shaileshjadhavSS
Solved start test button issue and Added python, django, Data engineering and common questions
2b130d1
raw
history blame
9.4 kB
question,option1,option2,option3,option4,answer,difficulty
"What is ETL in data engineering?","Extract, Transform, Load","Extract, Train, Load","Execute, Transfer, Log","Extract, Transform, List","Extract, Transform, Load","low"
"What is a data pipeline?","A sequence of data processing steps","A storage system for data","A process for loading data into a database","A type of data visualization","A sequence of data processing steps","low"
"What is Apache Kafka used for in data engineering?","Data storage","Real-time data streaming","Data warehousing","Data cleaning","Real-time data streaming","low"
"What is a data lake?","A large storage repository for raw data","A SQL database","A machine learning model","A type of data transformation","A large storage repository for raw data","low"
"What is the purpose of a data warehouse?","Store raw data","Transform data for processing","Store processed data for analysis","Generate data reports","Store processed data for analysis","low"
"What does Apache Spark do?","Data storage","Real-time data processing","Data transformation","Distributed computing for processing large datasets","Distributed computing for processing large datasets","low"
"What is the difference between batch processing and stream processing?","Batch is for real-time, stream is for scheduled tasks","Batch processes data in chunks, stream processes data in real-time","Batch is used for databases, stream is used for files","Batch stores data, stream processes data","Batch processes data in chunks, stream processes data in real-time","low"
"What is Hadoop?","A data processing framework","A data visualization tool","A machine learning model","A database system","A data processing framework","low"
"What is a NoSQL database?","A non-relational database","A database with no data","A relational database with flexible schemas","A database for storing unstructured data","A non-relational database","low"
"Which of the following is a columnar database?","MySQL","PostgreSQL","HBase","MongoDB","HBase","medium"
"What is a schema in a database?","A list of data points","A blueprint of how data is organized","A query language","A data transformation tool","A blueprint of how data is organized","medium"
"What is Apache Airflow used for?","Data storage","Data processing","Orchestrating workflows","Data analysis","Orchestrating workflows","medium"
"What is a data model?","A way of organizing data for storage and analysis","A tool for transforming data","A type of database system","A reporting tool","A way of organizing data for storage and analysis","medium"
"What is the difference between OLTP and OLAP?","OLTP is for transaction processing, OLAP is for data analysis","OLTP is for data analytics, OLAP is for real-time data storage","OLTP processes data in real-time, OLAP stores data in batches","OLTP stores data, OLAP processes it","OLTP is for transaction processing, OLAP is for data analysis","medium"
"How does data normalization work?","Storing data in a compressed format","Breaking down data into smaller pieces for easier storage","Ensuring data is consistent and avoids redundancy","Converting data into an encrypted format","Ensuring data is consistent and avoids redundancy","medium"
"What is the purpose of indexing in a database?","To store data efficiently","To speed up data retrieval","To back up data","To validate data integrity","To speed up data retrieval","medium"
"What is data partitioning?","Dividing large datasets into smaller, manageable pieces","Transforming data for reporting","Storing data in a compressed format","Creating backups of data","Dividing large datasets into smaller, manageable pieces","medium"
"What is a relational database?","A database that stores data in tables with relationships","A database that stores raw data","A database that doesn’t use schemas","A database designed for unstructured data","A database that stores data in tables with relationships","medium"
"What is a foreign key in a database?","A unique identifier for a record","A field that links two tables together","A type of index","A tool for data normalization","A field that links two tables together","medium"
"What is the purpose of data cleaning?","To process raw data for storage","To remove inconsistencies and inaccuracies in data","To create reports from data","To back up data for future use","To remove inconsistencies and inaccuracies in data","high"
"What is Apache Flink used for?","Data storage","Data transformation","Real-time data stream processing","Batch processing","Real-time data stream processing","high"
"How does a data pipeline handle failures?","It retries until successful","It stops and alerts the user","It logs the error and proceeds with a backup plan","It silently skips the data","It logs the error and proceeds with a backup plan","high"
"What is the purpose of an API in data engineering?","To store and process data","To enable communication between systems","To encrypt data","To clean data","To enable communication between systems","high"
"What is data governance?","Managing the integrity, security, and availability of data","The process of cleaning data","A type of database schema","A method of storing data in the cloud","Managing the integrity, security, and availability of data","high"
"What is a relational algebra operation?","A way of transforming relational data into visual reports","A mathematical operation for querying databases","A tool for data modeling","A type of data cleaning process","A mathematical operation for querying databases","high"
"What is CDC (Change Data Capture)?","A data transformation process","A technique for identifying and capturing changes in data over time","A method for removing duplicate data","A data warehousing technique","A technique for identifying and capturing changes in data over time","high"
"How do you manage big data?","By using NoSQL databases only","By storing it in filesystems","By splitting it into smaller chunks for parallel processing","By using relational databases only","By splitting it into smaller chunks for parallel processing","high"
"What is MapReduce?","A way of storing data","A technique for distributed data processing","A tool for visualizing data","A data analysis method","A technique for distributed data processing","high"
"What is cloud computing in data engineering?","Using on-premises servers for data storage","Storing data in a centralized database","Storing and processing data using remote servers and services","Using external hard drives for backup","Storing and processing data using remote servers and services","high"
"What is data sharding?","Dividing a large dataset into smaller, distributed parts","Encrypting data","Storing data in multiple formats","A data transformation technique","Dividing a large dataset into smaller, distributed parts","high"
"How do you perform data transformation?","By using SQL queries only","By using ETL tools or scripts","By manually changing data in databases","By storing data in different formats","By using ETL tools or scripts","high"
"What is the output of: print(type([]))?","list","tuple","dict","set","list","low"
"How do you declare a variable in Python?","var x = 10","int x = 10","x = 10","declare x = 10","x = 10","low"
"What does the len() function do?","Returns length of a sequence","Returns a list","Converts string to list","Creates a dictionary","Returns length of a sequence","low"
"Which of the following is a valid Python data type?","integer","float","string","All of the above","All of the above","low"
"How do you define a function in Python?","fun my_function()","def my_function()","function my_function()","None of the above","def my_function()","low"
"What is the output of: print(3 * 'Python')?","PythonPythonPython","Error","Python3","None","PythonPythonPython","medium"
"Which method is used to add an element to a set?","add()","append()","insert()","extend()","add()","medium"
"What is the output of: print(5 // 2)?","2.5","2","3","Error","2","medium"
"How do you handle exceptions in Python?","try-catch","try-finally","try-except","try-else","try-except","medium"
"What is a Python decorator?","A function returning another function","A type of module","A Python class","A data structure","A function returning another function","high"
"How do you define a lambda function in Python?","lambda x: x * 2","def lambda x: x * 2","lambda x => x * 2","None of the above","lambda x: x * 2","high"
"What is the time complexity of accessing an element in a dictionary?","O(1)","O(n)","O(log n)","O(n^2)","O(1)","high"
"How do you create a virtual environment in Python?","python3 -m venv venv","virtualenv venv","venv create venv","None of the above","python3 -m venv venv","high"
"What is the output of: print('Python'[::-1])?","nohtyP","Python","Error","None","nohtyP","high"
"What is the purpose of the 'continue' statement in Python?","Exits the loop","Skips the current iteration","Pauses the loop","None of the above","Skips the current iteration","high"
"What does the map() function do in Python?","Applies a function to all items in an iterable","Combines two lists","Filters out elements from a list","None of the above","Applies a function to all items in an iterable","high"
"How do you check if a key exists in a dictionary?","key in dict","dict.has_key(key)","key in dict.keys()","None of the above","key in dict","high"