Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MultimodalQnA Image and Audio Support Phase 1 #1071

Merged
merged 58 commits into from
Nov 8, 2024

Conversation

mhbuehler
Copy link
Contributor

@mhbuehler mhbuehler commented Nov 5, 2024

Description

This PR adds the following new features as specified in "Phase 1" of this RFC. The related PR in GenAIComps is opea-project/GenAIComps#852 and it will need to be merged before this one in GenAIExamples.

Data prep and ingestion enhancements:

  • Accept image only
  • Accept image and text
  • Accept speech audio only

Other enhancements:

  • Allow the user to choose the embedding model and LVM when starting the services

Note that the planned query enhancement "Accept speech audio only" has been moved to Phase 2 and a PR for that phase will be submitted for the next release.

Issues

MultimodalQnA Image & Audio Support RFC

Type of change

List the type of change like below. Please delete options that are not relevant.

  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)
  • Others (enhancement, documentation, validation, etc.)

Dependencies

No new dependencies

Tests

Updated the individual microservice's test scripts, the GenAIExamples' MultimodalQnA test scripts, and did manual testing of the UI and documented curl commands.

mhbuehler and others added 30 commits October 14, 2024 16:28
* added entrypoint override

Signed-off-by: okhleif-IL <[email protected]>

* fixed entrypoint container

Signed-off-by: okhleif-IL <[email protected]>

* removed workdir

Signed-off-by: okhleif-IL <[email protected]>

* reverted set env vars changes

Signed-off-by: okhleif-IL <[email protected]>

* added entrypoint for embedding model

Signed-off-by: okhleif-IL <[email protected]>

* added llava model to readme

Signed-off-by: okhleif-IL <[email protected]>

* removed stray \n

Signed-off-by: okhleif-IL <[email protected]>

* added entrypoint override in bridgetower hpu

Signed-off-by: okhleif-IL <[email protected]>

---------

Signed-off-by: okhleif-IL <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
* changed variable names

Signed-off-by: okhleif-IL <[email protected]>

* changed gaudi env vars

Signed-off-by: okhleif-IL <[email protected]>

* changed vars + docs

Signed-off-by: okhleif-IL <[email protected]>

* verified + updated readme

Signed-off-by: okhleif-IL <[email protected]>

* updated readme

Signed-off-by: okhleif-IL <[email protected]>

* updated tests

Signed-off-by: okhleif-IL <[email protected]>

---------

Signed-off-by: okhleif-IL <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Signed-off-by: Melanie Buehler <[email protected]>
Copy link
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @mhbuehler @dmsuehir and @okhleif-IL for this well put together PR!

LGTM!

mhbuehler and others added 5 commits November 6, 2024 09:00
Updated main README and supported_examples with MMQA details
Fix for lvm-dependent tests and clear textbox
Signed-off-by: okhleif-IL <[email protected]>
Copy link
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Putting this PR on hold until opea-project/GenAIComps#852 is merged.

MultimodalQnA/tests/test_compose_on_gaudi.sh Outdated Show resolved Hide resolved
MultimodalQnA/tests/test_compose_on_xeon.sh Outdated Show resolved Hide resolved
@lvliang-intel lvliang-intel requested a review from Spycsh November 8, 2024 01:53
@Spycsh
Copy link
Member

Spycsh commented Nov 8, 2024

opea-project/GenAIComps#852 is merged now, please resolve comments from @ashahba .

Copy link
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@ashahba
Copy link
Collaborator

ashahba commented Nov 8, 2024

@tileintel this one is ready to be merged.
All I need is your 👍
Thanks.

@joshuayao joshuayao linked an issue Nov 8, 2024 that may be closed by this pull request
@ftian1 ftian1 merged commit bbc95bb into opea-project:main Nov 8, 2024
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Image and Audio Support for MultimodalityQnA
6 participants