Completes OPEN-5856 Add RAG tracing example

gustavocidornelas · whoseoyster · commit 3276f64250ab · 2024-03-19T11:17:31.000-07:00
diff --git a/examples/monitoring/llms/rag-tracing/context.txt b/examples/monitoring/llms/rag-tracing/context.txt
@@ -0,0 +1,5 @@
+'Apple was founded as Apple Computer Company on April 1, 1976, by  Steve Wozniak, Steve Jobs (1955–2011) and Ronald Wayne to develop and sell Wozniak\'s Apple I personal computer. It was incorporated by Jobs and Wozniak as Apple Computer, Inc. in 1977. The company\'s second computer, the Apple II, became a best seller and one of the first mass-produced microcomputers. Apple went public in 1980 to instant financial success. The company developed computers featuring innovative graphical user interfaces, including the 1984 original Macintosh, announced that year in a critically acclaimed advertisement called "1984". By 1985, the high cost of its products, and power struggles between executives, caused problems. Wozniak stepped back from Apple and pursued other ventures, while Jobs resigned and founded NeXT, taking some Apple employees with him.', 'History\nMain article: History of Apple Inc.\n1976–1980: Founding and incorporation\nSee also: History of Apple Inc. §\xa01971–1985: Jobs and Wozniak\nIn 1976, Steve Jobs co-founded Apple in his parents\' home on Crist Drive in Los Altos, California.[9] Although it is widely believed that the company was founded in the house\'s garage, Apple co-founder Steve Wozniak called it "a bit of a myth".[10] Jobs and Wozniak did, however, move some operations to the garage when the bedroom became too crowded.[11]\nApple\'s first product, the Apple I, designed by Steve Wozniak, was sold as an assembled circuit board and lacked basic features such as a keyboard, monitor, and case. The owner of this unit added a keyboard and wooden case.\nThe Apple II Plus, introduced in 1979, designed primarily by Wozniak', 'Guerrino De Luca\nPaul Deneve\nAl Eisenstat\nTony Fadell\nScott Forstall\nEllen Hancock\nNancy R. Heinen\nRon Johnson\nDavid Nagel\nPeter Oppenheimer\nMark Papermaster\nJon Rubinstein\nBertrand Serlet\nBruce Sewell\nSina Tamaddon\nAvie Tevanian\nSteve Wozniak\nBoard ofdirectorsCurrent\nArthur D. Levinson (Chairman)\nTim Cook (CEO)\nJames A. Bell\nAlex Gorsky\nAl Gore\nAndrea Jung\nRonald D. Sugar\nSusan L. Wagner\nFormer\nMike Markkula (Chairman)\nJohn Sculley (Chairman)\nSteve Jobs (Chairman)\nGil Amelio\nFred D. Anderson\nBill Campbell\nMickey Drexler\nAl Eisenstat\nLarry Ellison\nRobert A. Iger\nDelano Lewis\nArthur Rock\nEric Schmidt\nMichael Scott\nMichael Spindler\nEdgar S. Woolard Jr.\nJerry York\nFounders\nSteve Jobs\nSteve Wozniak\nRonald Wayne', '^ Moses, Asher (October 7, 2011). "Who was Steve Jobs the man?". The Age. Melbourne. Retrieved October 7, 2011.; "Tearful memories for Apple co-founder". The Age. Melbourne. Archived from the original on October 8, 2011. Retrieved October 7, 2011.\n\n^ Flynn, Laurie J. (February 6, 2007). "After Long Dispute, Two Apples Work It Out". The New York Times. Archived from the original on February 7, 2007. Retrieved October 21, 2016.\n\n^ "Wired News: Apple Doin\' the Logo-Motion". September 26, 2003.; "¥ves ฿ennaïm 🌿 (@ZLOK) on Twitter". twitter.com.\n\n^ "Apple Computer". August 27, 1999. Archived from the original on August 27, 1999. Retrieved January 1, 2014.\n\n^ "The Lost Apple Logos You\'ve Never Seen
+
+"Apple also experimented with a number of other unsuccessful consumer targeted products during the 1990s, including digital cameras, portable CD audio players, speakers, video game consoles, the eWorld online service, and TV appliances. Most notably, enormous resources were invested in the problem-plagued Newton tablet division, based on John Sculley's unrealistic market forecasts.[65]\nThroughout this period, Microsoft continued to gain market share with Windows by focusing on delivering software to inexpensive personal computers, while Apple was delivering a richly engineered but expensive experience.[66] Apple relied on high profit margins and never developed a clear response; instead, they sued Microsoft for using a GUI similar to the Apple Lisa in Apple Computer, Inc. v. Microsoft Corp.[67] The lawsuit dragged on for years before it was finally dismissed.", '2007–2011: Success with mobile devices\nNewly announced iPhone on display at the 2007 MacWorld Expo\nDuring his keynote speech at the Macworld Expo on January 9, 2007, Jobs announced that Apple Computer, Inc. would thereafter be known as "Apple Inc.", because the company had shifted its emphasis from computers to consumer electronics.[110] This event also saw the announcement of the iPhone[111] and the Apple TV.[112] The company sold 270,000 iPhone units during the first 30 hours of sales,[113] and the device was called "a game changer for the industry".[114]', 'The success of the lower-cost Macs and PowerBook brought increasing revenue.[59] For some time, Apple was doing incredibly well, introducing fresh new products and generating increasing profits in the process. The magazine MacAddict named the period between 1989 and 1991 as the "first golden age" of the Macintosh.[60]The PenLite is Apple\'s first prototype of a tablet computer. Created in 1992, the project was designed to bring the Mac OS to a tablet – but was canceled in favor of the Newton.[61]\nThe success of Apple\'s lower-cost consumer models, especially the LC, also led to the cannibalization of their higher-priced machines. To address this, management introduced several new brands, selling largely identical machines at different price points, aimed at different markets: the high-end Quadra models, the mid-range Centris line, and the consumer-marketed Performa series. This led to significant market confusion, as customers did not understand the difference between models.[62]', "In May 2001, the company opened its first two Apple Store retail locations in Virginia and California,[91][92] offering an improved presentation of the company's products.[93] At the time, many speculated that the stores would fail,[94] but they went on to become highly successful, and the first of more than 500 stores around the world.[95]\nOn October 23, 2001, Apple debuted the iPod portable digital audio player. The product, which was first sold on November 10, 2001, was phenomenally successful with over 100\xa0million units sold within six years.[96]\nIn 2003, Apple's iTunes Store was introduced. The service offered music downloads for 99¢ a song and integration with the iPod. The iTunes Store quickly became the market leader in online music services, with over five billion downloads by June 19, 2008.[97] Two years later, the iTunes Store was the world's largest music retailer.[98]"
+
+'On December 12, 1980, Apple (ticker symbol "AAPL") went public selling 4.6\xa0million shares at $22 per share ($.10 per share when adjusting for stock splits as of September\xa03, 2022[update]),[21] generating over $100\xa0million, which was more capital than any IPO since Ford Motor Company in 1956.[30] By the end of the day, 300\xa0millionaires were created, from a stock price of $29 per share[31] and a market cap of $1.778\xa0billion.[30][31]', 'As of August\xa03, 2018[update], Apple was the largest publicly traded corporation in the world by market capitalization. On August 2, 2018, Apple became the first publicly traded U.S. company to reach a $1\xa0trillion market value.[284][285] Apple was ranked No. 4 on the 2018 Fortune 500 rankings of the largest United States corporations by total revenue.[286]\nIn July 2022, Apple reported an 11% decline in Q3 profits compared to 2021. Its revenue in the same period rose 2% year-on-year to $83 billion, though this figure was also lower than in 2021, where the increase was at 36%. The general downturn is reportedly caused by the slowing global economy and supply chain disruptions in China.[287]', "Apple became the first publicly traded U.S. company to be valued at over $1\xa0trillion in August 2018, then at $2\xa0trillion in August 2020, and at $3\xa0trillion in January 2022. In June 2023, it was valued at just over $3\xa0trillion.[8] The company receives criticism regarding the labor practices of its contractors, its environmental practices, and its business ethics, including anti-competitive practices and materials sourcing. Nevertheless, the company has a large following and enjoys a high level of brand loyalty. It has also been consistently ranked as one of the world's most valuable brands.", '^ Bagnall, Brian (2005). On the Edge: The Spectacular Rise and Fall of Commodore. Variant Press. pp.\xa0109–112. ISBN\xa0978-0-9738649-0-8.; Personal Computer Market Share: 1975–2004 Archived June 6, 2012, at the Wayback Machine The figures show Mac higher, but that is not a single model.\n\n^ Reimer, Jeremy (December 15, 2005). "Total share: 30 years of personal computer market share figures". Ars Technica. Retrieved September 1, 2023.\n\n^ a b Deffree, Suzanne (December 12, 2018). "Apple IPO makes instant millionaires, December 12, 1980". Retrieved May 16, 2019.\n\n^ a b Dilger, Daniel Eran (December 12, 2013). "Apple, Inc. stock IPO created 300 millionaires 33 years ago today". AppleInsider. Retrieved April 18, 2017.\n\n^ Montag, Ali (May 21, 2018). "Here\'s why your computer has a mouse, according to Steve Jobs in 1985". CNBC. Archived from the original on February 1, 2021. Retrieved May 10, 2021.'
diff --git a/examples/monitoring/llms/rag-tracing/rag_tracer.ipynb b/examples/monitoring/llms/rag-tracing/rag_tracer.ipynb
@@ -0,0 +1,201 @@
+{
+ "cells": [
+  {
+   "cell_type": "markdown",
+   "id": "83c16ef6-98e7-48d0-b82f-4029a730ff00",
+   "metadata": {},
+   "source": [
+    "[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/openlayer-ai/examples-gallery/blob/main/monitoring/llms/rag-tracing/rag_tracer.ipynb)\n",
+    "\n",
+    "\n",
+    "# <a id=\"top\">Tracing a RAG system</a>"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "21137554-ad8e-444b-bf2e-49393f072956",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "import openai\n",
+    "\n",
+    "# OpenAI env variable\n",
+    "os.environ[\"OPENAI_API_KEY\"] = \"YOUR_OPENAI_KEY_HERE\"\n",
+    "\n",
+    "# Openlayer env variables\n",
+    "os.environ[\"OPENLAYER_API_KEY\"] = \"YOUR_OPENLAYER_API_KEY_HERE\"\n",
+    "os.environ[\"OPENLAYER_PROJECT_NAME\"] = \"YOUR_OPENLAYER_PROJECT_NAME_HERE\" # Where the traces will be uploaded to"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "20b25a1f-529e-45c5-90e5-26485914f511",
+   "metadata": {},
+   "source": [
+    "## Defining and decorating our RAG system"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "9e2f8d80-d49a-48f0-8c12-350045dff985",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "%%bash\n",
+    "\n",
+    "if [ ! -e \"context.txt\" ]; then\n",
+    "    curl \"https://raw.githubusercontent.com/openlayer-ai/examples-gallery/main/monitoring/llms/rag-tracing/context.txt\" --output \"context.txt\"\n",
+    "fi"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "60d470d7-3aa0-4703-a9e7-cab24325a4a5",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import random\n",
+    "import time\n",
+    "\n",
+    "import numpy as np\n",
+    "from openai import OpenAI\n",
+    "from sklearn.feature_extraction.text import TfidfVectorizer\n",
+    "from sklearn.metrics.pairwise import cosine_similarity\n",
+    "\n",
+    "from openlayer import llm_monitors\n",
+    "from openlayer.tracing import tracer"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "c8070d3f-ebec-4faf-8959-23e6ac22737d",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "class RagPipeline:\n",
+    "    def __init__(self, context_path: str):\n",
+    "        # Wrap OpenAI client with Openlayer's OpenAIMonitor to trace it \n",
+    "        self.openai_client = OpenAI()\n",
+    "        llm_monitors.OpenAIMonitor(client=self.openai_client)\n",
+    "        \n",
+    "        self.vectorizer = TfidfVectorizer()\n",
+    "        with open(context_path, 'r', encoding='utf-8') as file:\n",
+    "            self.context_sections = file.read().split('\\n\\n')  \n",
+    "        self.tfidf_matrix = self.vectorizer.fit_transform(self.context_sections)\n",
+    "\n",
+    "    # Decorate the functions you'd like to trace with @tracer.trace()\n",
+    "    @tracer.trace()\n",
+    "    def query(self, user_query: str) -> str:\n",
+    "        \"\"\"Main method.\n",
+    "\n",
+    "        Answers to a user query with the LLM.\n",
+    "        \"\"\"\n",
+    "        context = self.retrieve_context(user_query)\n",
+    "        prompt = self.inject_prompt(user_query, context)\n",
+    "        answer = self.generate_answer_with_gpt(prompt)\n",
+    "        return answer\n",
+    "\n",
+    "    @tracer.trace()\n",
+    "    def retrieve_context(self, query: str) -> str:\n",
+    "        \"\"\"Context retriever. \n",
+    "        \n",
+    "        Given the query, returns the most similar context (using TFIDF).\n",
+    "        \"\"\"\n",
+    "        query_vector = self.vectorizer.transform([query])\n",
+    "        cosine_similarities = cosine_similarity(query_vector, self.tfidf_matrix).flatten()\n",
+    "        most_relevant_idx = np.argmax(cosine_similarities)\n",
+    "        return self.context_sections[most_relevant_idx]\n",
+    "\n",
+    "    @tracer.trace()\n",
+    "    def inject_prompt(self, query: str, context: str):\n",
+    "        \"\"\"Combines the query with the context and returns\n",
+    "        the prompt (formatted to conform with OpenAI models).\"\"\"\n",
+    "        return [\n",
+    "            {\"role\": \"system\", \"content\": \"You are a helpful assistant.\"},\n",
+    "            {\"role\": \"user\", \"content\": f\"Answer the user query using only the following context: {context}. \\nUser query: {query}\"}\n",
+    "        ]\n",
+    "\n",
+    "    @tracer.trace()\n",
+    "    def generate_answer_with_gpt(self, prompt):\n",
+    "        \"\"\"Forwards the prompt to GPT and returns the answer.\"\"\"\n",
+    "        response = self.openai_client.chat.completions.create(\n",
+    "            messages=prompt,\n",
+    "            model=\"gpt-3.5-turbo\",\n",
+    "        )\n",
+    "        return response.choices[0].message.content.strip()"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "f96f7073-7be4-4254-a6c9-eb808312beb8",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "rag = RagPipeline(\"context.txt\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "50e046fd-68f1-4f66-b2a1-03aa95b9b367",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "rag.query(\"Who were the founders of Apple?\")"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "afc7f963-fc13-4e93-b3ef-98aa183770a3",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "rag.query(\"When did Apple IPO?\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "42f1e832-4c3f-4a6a-8013-8607ff141f67",
+   "metadata": {},
+   "source": [
+    "That's it! After each inference, the traces are uploaded to Openlayer. If you navigate to your project, you should see the traces for these two inferences with our RAG system."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "f960a36f-3438-4c81-8cdb-ca078aa509cd",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.18"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}