-
-
Notifications
You must be signed in to change notification settings - Fork 160
[OPENAI] Support image edits with gpt-image-1 #152
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
issue #138 |
…ith-image * 'main' of github.com:crmne/ruby_llm: (24 commits) Enhance Rails guide with detailed persistence flow explanation and setup instructions Remove work-in-progress warning from models documentation generation Add validation considerations for Message model and update persistence flow documentation Add note about upcoming OpenAI headers support in v1.3.0 Handle OpenAI organization and project IDs (crmne#162) Refactor acts_as_message and acts_as_tool_call methods to improve parameter handling and default values Remove reasoning section from available models documentation and rake task Remove debug logging for pricing in OpenRouter models Updated models page Fixed pricing parsing for OpenRouter Updated models Add warning about work in progress for Parsera integration in available models documentation Major refactoring of ModelInfo and Parsera API support for listing LLM capabilities and pricing. Fix inflector (crmne#159) Use foreign_key instead of to_s for acts_as methods (crmne#157) Fixes #embed fails when using default embedding model Add support for logging to file via configuration (crmne#148) Updated acts_as_* helpers to use canonical 'rails-style' foreign keys (crmne#151) refactor(media): streamline content formatting methods across providers Fixed Calling `chat.to_llm` keeps appending messages to the message array ...
would be great to have your feedback @crmne I am a bit struggling with capabilities for |
@crmne would be great to not ghost PRs. at least pin out what could be done better if you don't agree |
Thanks for the work on this. Image editing is definitely in scope, but I'd prefer extending the existing # Generate from scratch (current behavior)
RubyLLM.paint("a sunset over mountains")
# Edit existing image (new behavior)
RubyLLM.paint("make it more vibrant", with: "path/to/image.png") This keeps the API consistent with how On "ghosting": I respond when I can. This is unpaid work I do between running my business and other priorities. Characterizing my delayed responses as "ghosting" is inappropriate and creates a toxic environment. I'll review this properly when I have time. |
@crmne thats the feeling I had, my bad if you were hurt by "ghosting". As you know we also contribute (for free), when a PR doesn't get merged and we keep having conflicts as we try to catch up main branch which is frustrating. Regarding the PR I wanted to use Thanks for maintaining this gem |
I appreciate the apology. The merge conflict frustration is understandable. I still prefer extending The API should be: RubyLLM.paint("prompt") # generate
RubyLLM.paint("prompt", with: "path") # edit This matches how |
Still a draft but I use it in production in my app https://github.com/sbounmy/hongbao_bitcoin
Usage
Todo :
gpt-image-1
when I ran the rake task so I added manually)