Skip to content

Commit 69b2530

Browse files
committed
Final updates to the tutorial.
1 parent 067a14a commit 69b2530

File tree

6 files changed

+28
-1
lines changed

6 files changed

+28
-1
lines changed
9.38 KB
Loading
9.67 KB
Loading
54.2 KB
Loading
25.8 KB
Loading
5.35 KB
Loading

nvidia-nemo-oke/readme.md

Lines changed: 28 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -497,13 +497,40 @@
497497
498498
## Task 5: Working with sample reranking and embedding notebooks
499499
500-
1.
500+
1. Locate the two sample notebooks: [reranking](sample-notebooks/reranking_23ai_clean.ipynb) and [text embedding](sample-notebooks/text_embedding_23ai_clean.ipynb in the **`sample-notebooks`** directory. Download to your computer.
501501
502+
2. Upload the notebooks to JupyterHub.
502503
504+
![Upload files to Jupyter](images/jupyter-upload-files.png)
503505
506+
3. Start with the **text embedding** notebook. In the second cell you'll need to paste your NGC API Key. Run cells 1 and 2 to import libraries and validate your NGC API key.
504507
508+
![Input your API key](images/jupyter-input-api-key.png)
505509
510+
4. Next, locate the 10th cell where you'll need to input the Database connection information you gathered / tested earlier.
506511
512+
![DB connection setup](images/jupyter-db-connection-setup.png)
513+
514+
5. After updating the fields, run this cell to confirm DB connectivity. A successful connection should look like this:
515+
516+
![DB connection success](images/jupyter-db-connection-success.png)
517+
518+
6. Now head back up to the top of the notebook and run all cells.
519+
520+
![Text Embedding notebook](images/jupyter-run-all-cells.png)
521+
522+
7. Scrolling through you should see several different questions. Note cell 9 which asks about the NVIDIA H200. At this time, the LLM has no data on this product. Cell 15 performs text embedding on the product page for the H200. You'll then see in Cell 27 after the entire notebook completes, that RAG is able to provide an answer to the question about the H200.
523+
524+
8. Moving to the **Reranking** notebook, be sure to repeat steps 3-5 above. Then run the entire notebook. Reranking will orient the data in a more efficient manner, yielding faster and more accurate results with a lower degree of hullicinations.
525+
526+
9. And now, you've completed the tutorial on deploying NVIDIA NeMo microservices to Oracle Kubernetes Engine (OKE). If you'd like to experiment, you can upload different PDF files to see how embedding and reranking will function with additional data.
527+
528+
529+
## Acknowledgements
530+
531+
* **Author(s)** - Eli Schilling - Technical Architect, Sadra Fardhosseini - Data Scientist
532+
* **Contributors** -
533+
* **Last Updated By/Date** - October, 2024
507534
508535
509536

0 commit comments

Comments
 (0)