|
497 | 497 |
|
498 | 498 | ## Task 5: Working with sample reranking and embedding notebooks
|
499 | 499 |
|
500 |
| -1. |
| 500 | +1. Locate the two sample notebooks: [reranking](sample-notebooks/reranking_23ai_clean.ipynb) and [text embedding](sample-notebooks/text_embedding_23ai_clean.ipynb in the **`sample-notebooks`** directory. Download to your computer. |
501 | 501 |
|
| 502 | +2. Upload the notebooks to JupyterHub. |
502 | 503 |
|
| 504 | +  |
503 | 505 |
|
| 506 | +3. Start with the **text embedding** notebook. In the second cell you'll need to paste your NGC API Key. Run cells 1 and 2 to import libraries and validate your NGC API key. |
504 | 507 |
|
| 508 | +  |
505 | 509 |
|
| 510 | +4. Next, locate the 10th cell where you'll need to input the Database connection information you gathered / tested earlier. |
506 | 511 |
|
| 512 | +  |
| 513 | +
|
| 514 | +5. After updating the fields, run this cell to confirm DB connectivity. A successful connection should look like this: |
| 515 | +
|
| 516 | +  |
| 517 | +
|
| 518 | +6. Now head back up to the top of the notebook and run all cells. |
| 519 | +
|
| 520 | +  |
| 521 | +
|
| 522 | +7. Scrolling through you should see several different questions. Note cell 9 which asks about the NVIDIA H200. At this time, the LLM has no data on this product. Cell 15 performs text embedding on the product page for the H200. You'll then see in Cell 27 after the entire notebook completes, that RAG is able to provide an answer to the question about the H200. |
| 523 | +
|
| 524 | +8. Moving to the **Reranking** notebook, be sure to repeat steps 3-5 above. Then run the entire notebook. Reranking will orient the data in a more efficient manner, yielding faster and more accurate results with a lower degree of hullicinations. |
| 525 | +
|
| 526 | +9. And now, you've completed the tutorial on deploying NVIDIA NeMo microservices to Oracle Kubernetes Engine (OKE). If you'd like to experiment, you can upload different PDF files to see how embedding and reranking will function with additional data. |
| 527 | +
|
| 528 | +
|
| 529 | +## Acknowledgements |
| 530 | +
|
| 531 | +* **Author(s)** - Eli Schilling - Technical Architect, Sadra Fardhosseini - Data Scientist |
| 532 | +* **Contributors** - |
| 533 | +* **Last Updated By/Date** - October, 2024 |
507 | 534 |
|
508 | 535 |
|
509 | 536 |
|
|
0 commit comments