Stream responses #39

jtpio · 2025-02-13T07:21:09Z

Fixes #36

jupyterlite-ai-stream.mp4

The tokens do seem to be correctly streamed when looking at the request in the dev tools:

For reference langchain handles "auto streaming": https://js.langchain.com/docs/concepts/streaming/#auto-streaming-chat-models

jtpio · 2025-02-13T07:42:05Z

src/chat-handler.ts

+        time: Date.now(),
+        type: 'msg'
+      };
+      this.messageAdded(errorMsg);


This seems to be doing the job, although there may be some future optimizations to be done in jupyter-chat.

Currently jupyter-chat seems to be removing the previous message before adding the one with the update here:

https://github.com/jupyterlab/jupyter-chat/blob/73512c16a6c602b81f873ecf9a3c20cb73cefd94/packages/jupyter-chat/src/model.ts#L435-L449

Maybe it should also compare the timestamps like the comment suggests?

I think the comment do not refer to the correct line, the message is sent/updated above:

this.messageAdded(botMsg);

In your implementation, the timestamp is not updated, so the position of the message should not change. And it is probably better like this, to avoid confusion.
For comparison, streaming in jupyter-ai lead to update_message in the back-end, which do not update the timestamp.

We should probably homogenize this in jupyter-chat, and allow consumer extension to change or not the timestamp...

Yes right it was about the line above (which Github shows as not modified).

the timestamp is not updated, so the position of the message should not change. And it is probably better like this, to avoid confusion.

Yes, in terms of UX this seems fine.

Also on another note for a different issue, after using the jupyter-chat API, I was wondering if changing to the following could make sense:

messageAdded -> addMessage

messagesDeleted -> deleteMessage

In general when working with extensions, when properties finish with ed they tend to refer to signals. Also it may more of a personal opinion but addMessage sounds more natural than messageAdded when interacting with the chat model.

I agree, but in this case it could also be confusing with sendMessage and deleteMessage that already exist.

This is the distinction between:

the actions the user do in its chat (sendMessage and deleteMessage) that must be propagated to the other users

the actions done by remote users (messageAdded and messagesDeleted) and that must update the chat view...

In the case of jupyterlite chats, it is most confusing because there is no back-end, the sendMessage propagates itself the messages from the user and the AI.

It may worth an issue in jupyter-chat to discuss the naming.

Yeah happy to open an issue on https://github.com/jupyterlab/jupyter-chat to discuss this 👍

jtpio · 2025-02-13T07:58:07Z

Do we want to let users choose between streaming and not streaming responses?

Probably it's likely fine to default to streaming for now if we can, since this is what all AI chats are doing at the moment. But if we want to let users choose there could be a new option in the settings.

brichet · 2025-02-13T14:20:12Z

src/chat-handler.ts

+      for await (const chunk of await this._aiProvider.chatModel.stream(
+        messages
+      )) {
+        content += chunk.content;


I'm wondering if the stream are always additional or could try to modify previous part of the message.
I don't know if there is a setting for that, and if it worth looking for that feature in future.

Guessing they should be additional as otherwise it would be difficult to tell if that's the case or not?

Or you mean if some providers would return the complete updated message each time?

For example:

He

Hello

Hello,

Hello, world

Instead of:

He

llo

,

world

The second one is more efficient in terms of bandwidth and data being transferred, so there may not be many advantages for the first approach?

In this case I agree that additional is the best approach.

Guessing they should be additional as otherwise it would be difficult to tell if that's the case or not?

Indeed, it must always act in the same way.

To explain my question, it looks like it's writing word by word, like a human. But sometime you realise in the middle of the sentence that you could have wrote it differently for more consistency, and want to reformat the whole sentence. I was wondering if some model could act similarly to human. But as you said, it would be difficult to know with this API.

brichet

Looks great, thanks @jtpio

brichet · 2025-02-13T14:24:55Z

Do we want to let users choose between streaming and not streaming responses?

Probably it's likely fine to default to streaming for now if we can, since this is what all AI chats are doing at the moment. But if we want to let users choose there could be a new option in the settings.

We can think about it in a follow up PR, I agree that having it as default seems good for now.

jtpio · 2025-02-13T19:29:28Z

Thanks @brichet for the review!

Merging to be able to use this change in other PRs. Happy to track questions and improvements separately.

Support streaming

73b37b5

jtpio added the enhancement New feature or request label Feb 13, 2025

jtpio commented Feb 13, 2025

View reviewed changes

jtpio marked this pull request as ready for review February 13, 2025 07:58

minor fix

a8319bd

jtpio requested a review from brichet February 13, 2025 11:50

brichet reviewed Feb 13, 2025

View reviewed changes

brichet approved these changes Feb 13, 2025

View reviewed changes

jtpio merged commit fb52aa8 into jupyterlite:main Feb 13, 2025
7 checks passed

jtpio deleted the stream branch February 13, 2025 19:29

brichet mentioned this pull request Feb 17, 2025

Update to a newer @langchain/community to fix ChromeAI integration #43

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stream responses #39

Stream responses #39

jtpio commented Feb 13, 2025 •

edited

Loading

jtpio Feb 13, 2025

brichet Feb 13, 2025

jtpio Feb 13, 2025

jtpio Feb 13, 2025

brichet Feb 13, 2025

jtpio Feb 13, 2025

jtpio commented Feb 13, 2025

brichet Feb 13, 2025 •

edited

Loading

jtpio Feb 13, 2025

jtpio Feb 13, 2025

brichet Feb 13, 2025

brichet left a comment

brichet commented Feb 13, 2025

jtpio commented Feb 13, 2025

Stream responses #39

Stream responses #39

Conversation

jtpio commented Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jtpio commented Feb 13, 2025

brichet Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

brichet left a comment

Choose a reason for hiding this comment

brichet commented Feb 13, 2025

jtpio commented Feb 13, 2025

jtpio commented Feb 13, 2025 •

edited

Loading

brichet Feb 13, 2025 •

edited

Loading