diff --git a/self-paced-labs/data-science-spark/spark_data_science.ipynb b/self-paced-labs/data-science-spark/spark_data_science.ipynb index 1408f94224..8b6313766d 100644 --- a/self-paced-labs/data-science-spark/spark_data_science.ipynb +++ b/self-paced-labs/data-science-spark/spark_data_science.ipynb @@ -94,7 +94,7 @@ "id": "qm7aYYPI04tE" }, "source": [ - "To use [Google Cloud Serverless for Apache Spark](https://docs.cloud.google.com/dataproc-serverless/docs), turn on [Private Google Access](https://docs.cloud.google.com/vpc/docs/configure-private-google-access#gcloud_1)." + "To use [Managed Service for Apache Spark](https://docs.cloud.google.com/dataproc-serverless/docs) serverless, turn on [Private Google Access](https://docs.cloud.google.com/vpc/docs/configure-private-google-access#gcloud_1)." ] }, { @@ -236,7 +236,7 @@ "id": "KuwqTKG0vXE3" }, "source": [ - "For large workloads, you can use [Lightning Engine](https://cloud.google.com/products/lightning-engine) to further accelerate your Spark workloads by up to [4.3x](https://docs.cloud.google.com/dataproc-serverless/docs/guides/lightning-engine#enable_lightning_engine). The dataset in this lab is too small to see meaningful differences. For reference, you could use the following code with your own notebook and data:\n", + "For large workloads, you can use [Lightning Engine](https://cloud.google.com/products/lightning-engine) to further accelerate your Spark workloads by up to [4.9x](https://docs.cloud.google.com/dataproc-serverless/docs/guides/lightning-engine#enable_lightning_engine). The dataset in this lab is too small to see meaningful differences. For reference, you could use the following code with your own notebook and data:\n", "\n", "```\n", "session.runtime_config.properties = {\n", @@ -479,7 +479,7 @@ "id": "bTgsXlzq04yR" }, "source": [ - "**Prompt 5**: Using the table \"users\", generate code to plot country vs traffic source using a suitable plotting library." + "**Prompt 5**: Using the table \"users\", generate code to plot country vs traffic source using a suitable plotting library. Import all necessary libraries." ] }, { @@ -497,7 +497,7 @@ "id": "5uKeAC2Y09kM" }, "source": [ - "**Prompt 6:** Create distribution plots for \"age\", \"country\", \"gender\", and \"traffic_source\". Use appropriate plots for categorical columns." + "**Prompt 6:** Create distribution plots for \"age\", \"country\", \"gender\", and \"traffic_source\". Use appropriate plots for categorical columns. Import all necessary libraries." ] }, {