1.Question 1Which of the following statements is true about a directed acyclic graph (DAG)? Select all that apply.1 pointIn Apache Spark, RDDs are represented by the verticesA tabular data structure with rows and columnsA new edge is obtained from an older vertexA data structure with edges and vertices2.Question 2Which function is applied to create a data set from a sequence?1 pointCreate()toDS()DSRdd()seqDS()3.Question 3Which of the following is a feature of Tungsten? 1 pointEnhances security by restricting developers from adding data source-specific rulesUtilizes CPU registers for storing intermediate dataDepends on the JVM object modelGenerates an optimized physical query plan from the logical query plan4.Question 4While adhering to best practices, in what order does a typical data engineer perform operations on Apache Spark?1 pointRead, analyze, transform, load, and write Analyze, read, transform, load, and writeRead, analyze, load, transform, and writeAnalyze, read, load, transform, and write5.Question 5Which data sources can be utilized with Apache Spark SQL?1 pointMongoDBCustom file formatsExternal APIsParquet files6.Question 6How can you create a Global Temporary view in Spark SQL?1 pointUse the createTempView function with a “Global” prefixUse the createView function with a “Global” prefix Use the createGlobalTempView function Use the createGlobalView function 7.Question 7Spark creates a Directed Acyclic Graph (DAG) during the creation of a Resilient Distributed Dataset (RDD).The DAG is associated with the new RDD.If there is an action, the driver program, which invokes calls the action, evaluates the DAG after Spark completes the action.The pointer responsible for transforming the RDD returns to the Spark driver program.Spark utilizes the DAG Scheduler to perform a transformation and updates the DAG accordingly.Select the sequence that explains RDD transformation and actions.1 point1 >> 3 >> 4 >> 5 >> 21 >> 2 >> 3 >> 4 >> 51 >> 5 >> 2 >> 4 >> 32 >> 3 >> 4 >> 1 >> 5
Question
1.Question 1Which of the following statements is true about a directed acyclic graph (DAG)? Select all that apply.1 pointIn Apache Spark, RDDs are represented by the verticesA tabular data structure with rows and columnsA new edge is obtained from an older vertexA data structure with edges and vertices2.Question 2Which function is applied to create a data set from a sequence?1 pointCreate()toDS()DSRdd()seqDS()3.Question 3Which of the following is a feature of Tungsten? 1 pointEnhances security by restricting developers from adding data source-specific rulesUtilizes CPU registers for storing intermediate dataDepends on the JVM object modelGenerates an optimized physical query plan from the logical query plan4.Question 4While adhering to best practices, in what order does a typical data engineer perform operations on Apache Spark?1 pointRead, analyze, transform, load, and write Analyze, read, transform, load, and writeRead, analyze, load, transform, and writeAnalyze, read, load, transform, and write5.Question 5Which data sources can be utilized with Apache Spark SQL?1 pointMongoDBCustom file formatsExternal APIsParquet files6.Question 6How can you create a Global Temporary view in Spark SQL?1 pointUse the createTempView function with a “Global” prefixUse the createView function with a “Global” prefix Use the createGlobalTempView function Use the createGlobalView function 7.Question 7Spark creates a Directed Acyclic Graph (DAG) during the creation of a Resilient Distributed Dataset (RDD).The DAG is associated with the new RDD.If there is an action, the driver program, which invokes calls the action, evaluates the DAG after Spark completes the action.The pointer responsible for transforming the RDD returns to the Spark driver program.Spark utilizes the DAG Scheduler to perform a transformation and updates the DAG accordingly.Select the sequence that explains RDD transformation and actions.1 point1 >> 3 >> 4 >> 5 >> 21 >> 2 >> 3 >> 4 >> 51 >> 5 >> 2 >> 4 >> 32 >> 3 >> 4 >> 1 >> 5
Solution
-
The correct statements about a directed acyclic graph (DAG) are: "In Apache Spark, RDDs are represented by the vertices" and "A data structure with edges and vertices".
-
The function applied to create a data set from a sequence is "toDS()".
-
A feature of Tungsten is that it "Generates an optimized physical query plan from the logical query plan".
-
A typical data engineer performs operations on Apache Spark in the following order: "Read, analyze, transform, load, and write".
-
The data sources that can be utilized with Apache Spark SQL are: "MongoDB", "Custom file formats", "External APIs", and "Parquet files".
-
You can create a Global Temporary view in Spark SQL using the "createGlobalTempView function".
-
The sequence that explains RDD transformation and actions is: "1 >> 2 >> 3 >> 4 >> 5".
Similar Questions
What is a directed acyclic graph (DAG)?Select one:a.A graph in which the edges have a direction and there are no cyclesb.A graph in which the edges do not have a direction and there are cyclesc.A graph in which the edges do not have a direction and there are no cyclesd.A graph in which the edges have a direction and there are cycles
What is an example of a real-world application of directed acyclic graphs (DAGs)?Select one:a.Social media networksb.Data pipelinesc.Computer networksd.All of the above
What is the technical term for a node in a directed acyclic graph (DAG)?Select one:a.Cycleb.Vertexc.Graphd.Edge
Consider the dag with set of nodes V = {V1, V2, V3, V4, V5, V6, V7} and set of edges {(V1,V2), (V1,V3), (V1,V4), (V2,V5), (V3,V5), (V3,V6), (V4,V6), (V5,V7), (V6,V7)}. Which of the following node sequences are topologically sorted? Group of answer choices V1, V3, V4, V6, V2, V5, V7 V1, V3, V2, V6, V4, V5, V7 V1, V3, V4, V5, V2, V6, V7 V1, V2, V5, V3, V4, V6, V7 V1, V4, V3, V2, V6, V5, V7
.Question 8Which of the following are actual data structures? 1 pointB TreesC TreesA Trees
Upgrade your grade with Knowee
Get personalized homework help. Review tough concepts in more detail, or go deeper into your topic by exploring other relevant questions.