move process_message inside BaseLLM #1021

geohotstan · 2024-03-17T15:35:15Z

Features

fixes 用gemini作为LLM，运行数据解析器中的data_visualization.py示例报错 #1016
I think moving process_message into BaseLLM is better since process_message is only used in the context of LLM use.

Feature Docs

Influence

Result

This PR fixes this specific error KeyError: "Could not recognize the intended type of the dict. A Content should have a 'parts' key. A Part should have a 'inline_data' or a 'text' key. A Blob should have 'mime_type' and 'data' keys. Got keys: ['role', 'content']"

Other

there seems to be more bugs related to Gemini's usage downstream. I'll try to find more of them after since I'm also using gemini :D

codecov-commenter · 2024-03-17T15:54:28Z

Codecov Report

Attention: Patch coverage is 60.00000% with 10 lines in your changes are missing coverage. Please review.

Project coverage is 82.09%. Comparing base (e40fc66) to head (f332531).

Files	Patch %	Lines
metagpt/provider/google_gemini_api.py	28.57%	5 Missing ⚠️
metagpt/provider/base_llm.py	71.42%	4 Missing ⚠️
metagpt/provider/openai_api.py	0.00%	1 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1021      +/-   ##
==========================================
+ Coverage   81.85%   82.09%   +0.24%     
==========================================
  Files         246      246              
  Lines       13725    13732       +7     
==========================================
+ Hits        11234    11273      +39     
+ Misses       2491     2459      -32

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

garylin2099 · 2024-03-19T02:23:54Z

A good point. Also @better629 please review gemini part of code.

garylin2099 · 2024-03-19T07:51:59Z

pre-commit checks failed, suggesting format issues. Please check https://docs.deepwisdom.ai/main/en/guide/contribute/contribute_guide.html#before-submission to format the code.

better629 · 2024-03-19T08:42:40Z

@geohotstan
The DI support GPT so far, and need further checking with gemini. And process_message has also been used in https://github.com/geekan/MetaGPT/blob/main/metagpt/actions/di/write_analysis_code.py.

So, let do it step-by-step

Can you run examples/llm_hello_world.py with gemini successful? to check if a gemini provider it-self problem.
the original process_message is used in openai provider under aask_code which is not used in gemini
Do a deeper coding, can you to check gemini support tool like openai, so we can support DI full functionality using gemini.

iorisa · 2024-03-20T10:06:14Z

process_message has been moved to provider foloder and rename to format_msg
fixbug #1058

Your approach to the modification is correct.
However, LLM has a set of functions for formatting messages, so I renamed process_message to format_msg to correspond with the existing function names:

class BaseLLM(ABC):
    ...

    def _user_msg(self, msg: str, images: Optional[Union[str, list[str]]] = None) -> dict[str, Union[str, dict]]:
        if images:
            # as gpt-4v, chat with image
            return self._user_msg_with_imgs(msg, images)
        else:
            return {"role": "user", "content": msg}

    def _user_msg_with_imgs(self, msg: str, images: Optional[Union[str, list[str]]]):
        """
        images: can be list of http(s) url or base64
        """
        if isinstance(images, str):
            images = [images]
        content = [{"type": "text", "text": msg}]
        for image in images:
            # image url or image base64
            url = image if image.startswith("http") else f"data:image/jpeg;base64,{image}"
            # it can with multiple-image inputs
            content.append({"type": "image_url", "image_url": url})
        return {"role": "user", "content": content}

    def _assistant_msg(self, msg: str) -> dict[str, str]:
        return {"role": "assistant", "content": msg}

    def _system_msg(self, msg: str) -> dict[str, str]:
        return {"role": "system", "content": msg}

    def format_msg(self, messages: Union[str, Message, list[dict], list[Message], list[str]]) -> list[dict]:
        """convert messages to list[dict]."""
        from metagpt.schema import Message

        if not isinstance(messages, list):
            messages = [messages]

        processed_messages = []
        for msg in messages:
            if isinstance(msg, str):
                processed_messages.append({"role": "user", "content": msg})
            elif isinstance(msg, dict):
                assert set(msg.keys()) == set(["role", "content"])
                processed_messages.append(msg)
            elif isinstance(msg, Message):
                processed_messages.append(msg.to_dict())
            else:
                raise ValueError(
                    f"Only support message type are: str, Message, dict, but got {type(messages).__name__}!"
                )
        return processed_messages

    def _system_msgs(self, msgs: list[str]) -> list[dict[str, str]]:
        return [self._system_msg(msg) for msg in msgs]

    def _default_system_msg(self):
        return self._system_msg(self.system_prompt)

geohotstan · 2024-03-21T04:03:37Z

other fix merged.

geohotstan added 2 commits March 17, 2024 23:05

moved process_message to be a BaseLLM method

21d7af9

forgot to delete original impl

d30d301

geohotstan had a problem deploying to unittest March 17, 2024 15:35 — with GitHub Actions Failure

geohotstan changed the title ~~Fix/gemini keys~~ move process_message inside BaseLLM Mar 17, 2024

oops forgot a file

641d9c4

geohotstan had a problem deploying to unittest March 17, 2024 15:41 — with GitHub Actions Failure

geohotstan had a problem deploying to unittest March 18, 2024 06:42 — with GitHub Actions Failure

geohotstan marked this pull request as ready for review March 18, 2024 06:43

garylin2099 requested a review from better629 March 19, 2024 02:23

geohotstan marked this pull request as draft March 19, 2024 13:09

geohotstan added 2 commits March 19, 2024 22:28

try pre-commit

39bb083

Merge branch 'main' into fix/gemini_keys

1194976

geohotstan force-pushed the fix/gemini_keys branch from f332531 to 1194976 Compare March 19, 2024 14:38

geohotstan had a problem deploying to unittest March 19, 2024 14:38 — with GitHub Actions Failure

geohotstan closed this Mar 21, 2024

geohotstan deleted the fix/gemini_keys branch April 18, 2024 18:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

move process_message inside BaseLLM #1021

move process_message inside BaseLLM #1021

geohotstan commented Mar 17, 2024 •

edited

Loading

codecov-commenter commented Mar 17, 2024 •

edited

Loading

garylin2099 commented Mar 19, 2024

garylin2099 commented Mar 19, 2024

better629 commented Mar 19, 2024

iorisa commented Mar 20, 2024 •

edited

Loading

geohotstan commented Mar 21, 2024

move process_message inside BaseLLM #1021

move process_message inside BaseLLM #1021

Conversation

geohotstan commented Mar 17, 2024 • edited Loading

codecov-commenter commented Mar 17, 2024 • edited Loading

Codecov Report

garylin2099 commented Mar 19, 2024

garylin2099 commented Mar 19, 2024

better629 commented Mar 19, 2024

iorisa commented Mar 20, 2024 • edited Loading

geohotstan commented Mar 21, 2024

geohotstan commented Mar 17, 2024 •

edited

Loading

codecov-commenter commented Mar 17, 2024 •

edited

Loading

iorisa commented Mar 20, 2024 •

edited

Loading