CIKM25-MME-SID

Code for paper titled 'Empowering Large Language Model for Sequential Recommendation via Multimodal Embeddings and Semantic IDs' accepted to CIKM 2025 Full Research track. Tesla V100 or A100 GPUs are preferred. In the following, we take Amazon Beauty dataset as an example.

Code Running

Preparation

First, download and implement Llama3-8B-instruct and LLM2CLIP. We use the environment with python 3.9.19 + torch 2.0.1, and install thuml/Transfer-Learning-Library and LLM2Vec by running the following code.

cd Transfer-Learning-Library-master
pip install -r requirements.txt
python setup.py install
cd llm2vec
pip install -e .

Stage 1: Encoding

We save the pre-trained collaborative, textual, and visual embedding into SASRec_item_embed_new.pkl, Beauty_llm2clip_text_emb.pt, and Beauty_llm2clip_pic_emb.pt under dataset folder. Then the MM-RQ-VAE is trained to generate semantic IDs and codebook embeddings.

python ./dataset/pic_text_embedding_generate.py #to generate Beauty_llm2clip_text_emb.pt and Beauty_llm2clip_pic_emb.pt
sh ./MM-RQ-VAE/train_tokenizer_MM.sh
sh ./MM-RQ-VAE/tokenize_MM.sh

Note: To generate Beauty_llm2clip_text_emb.pt and Beauty_llm2clip_pic_emb.pt, run dataset/pic_text_embedding_generate.py

Stage 2: Fine-tuning

We fine-tune the Llama3-8B-instruct to conduct sequential recommendation task by running the following code. After training, the evaluation result on the test set will be shown.

python MME-SID.py

It takes about 110 mins to train an epoch on 8 A100 GPUs and 160 mins on 8 V100 GPUs.

Prompt Template

We provide the prompt template as the input of Llama3-8B-instruct for sequential recommendation task. Specifically, the angle brackets denote the special tokens of Llama3 while the content in curly brackets, i.e., the behavioral items of each user needs to be filled.

<|begin_of_text|><|start_header_id|>system<|end_header_id|>\n\n You are a helpful AI assistant for recommendations<|eot_id|><|start_header_id|>user <|end_header_id|>\n\n Given the user's purchase history, predict next possible item to be purchased. {Behavioral Item Sequence} <|eot_id|><|start_header_id|>assistant<|end_header_id|>

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
MM-RQ-VAE		MM-RQ-VAE
dataset		dataset
utils		utils
LICENSE		LICENSE
MME-SID.py		MME-SID.py
README.md		README.md
finetune.py		finetune.py
inference.py		inference.py
model.py		model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CIKM25-MME-SID

Code Running

Prompt Template

About

Uh oh!

Releases

Packages

Languages

License

Applied-Machine-Learning-Lab/MME-SID

Folders and files

Latest commit

History

Repository files navigation

CIKM25-MME-SID

Code Running

Prompt Template

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages