What kind of Snowflake integration is required when defining an external function in Snowflake?
Answer : A
A Data Engineer is writing a Python script using the Snowflake Connector for Python. The Engineer will use the snowflake.connector.connect function to connect to Snowflake.
The requirements are:
Raise an exception if the specified database, schema, or warehouse does not exist
Improve download performance -
Which parameters of the connect function should be used? (Choose two.)
Answer : CE
A Data Engineer wants to centralize grant management to maximize security. A user needs OWNERSHIP on a table in a new schema. However, this user should not have the ability to make grant decisions.
What is the correct way to do this?
Answer : D
A CSV file, around 1 TB in size, is generated daily on an on-premise server. A corresponding table, internal stage, and file format have already been created in Snowflake to facilitate the data loading process.
How can the process of bringing the CSV file into Snowflake be automated using the LEAST amount of operational overhead?
Answer : B
What are characteristics of Snowpark Python packages? (Choose three.)
Answer : AEF
While running an external function, the following error message is received:
Error: Function received the wrong number of rows
What is causing this to occur?
Answer : D
A Data Engineer enables a result cache at the session level with the following command:
ALTER SESSION SET USE_CACHED_RESULT = TRUE;
The Engineer then runs the following SELECT query twice without delay:
SELECT *
FROM SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.CUSTOMER
SAMPLE(10) SEED (99);
The underlying table does not change between executions.
What are the results of both runs?
Answer : B
A company built a sales reporting system with Python, connecting to Snowflake using the Python Connector. Based on the user's selections, the system generates the SQL queries needed to fetch the data for the report. First it gets the customers that meet the given query parameters (on average 1000 customer records for each report run), and then it loops the customer records sequentially. Inside that loop it runs the generated SQL clause for the current customer to get the detailed data for that customer number from the sales data table.
When the Data Engineer tested the individual SQL clauses, they were fast enough (1 second to get the customers, 0.5 second to get the sales data for one customer), but the total runtime of the report is too long.
How can this situation be improved?
Answer : D
A company is using Snowpipe to bring in millions of rows every day of Change Data Capture (CDC) into a Snowflake staging table on a real-time basis. The CDC needs to get processed and combined with other data in Snowflake and land in a final table as part of the full data pipeline.
How can a Data Engineer MOST efficiently process the incoming CDC on an ongoing basis?
Answer : A
A Data Engineer is building a pipeline to transform a 1 TB table by joining it with supplemental tables. The Engineer is applying filters and several aggregations leveraging Common Table Expressions (CTEs) using a size Medium virtual warehouse in a single query in Snowflake.
After checking the Query Profile, what is the recommended approach to MAXIMIZE performance of this query if the Profile shows data spillage?
Answer : B
Which system role is recommended for a custom role hierarchy to be ultimately assigned to?
Answer : C
Which callback function is required within a JavaScript User-Defined Function (UDF) for it to execute successfully?
Answer : B
Which Snowflake feature facilitates access to external API services such as geocoders, data transformation, machine learning models, and other custom code?
Answer : C
A Data Engineer needs to know the details regarding the micro-partition layout for a table named Invoice using a built-in function.
Which query will provide this information?
Answer : A