GitHub - VickiCui/MORE: Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"

MORE (Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning)

Code release for paper: MORE:Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning (ACL Findings 2024). We propose a multi-modal retrieval augmented framework to assist LMs and MLMs in generating more sensible (commonsense-compliant) sentences.

MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning
Wanqing Cui, Keping Bi, Jiafeng Guo, Xueqi Cheng

Install

conda create -n more python=3.8
conda activate more
pip install -r requirements.txt
pip install datas/en_core_web_sm-3.0.0-py3-none-any.whl

Retrieved Data

You can download data crawled by us from this link, or crawl by yourself:

cd ./datas/commongen
#image
python src/grounding_image/crawl_image.py
python src/grounding_image/remove_repeat_image.py

#text
python src/grounding_text/crawl_text.py --n_threads 8

Get Features

Extract image and text features in advance and store them as .lmdb to speed up training. The following scripts will result in two folders, i.e., blip2_image_feats.lmdb and blip2_text_feats.lmdb located at ./datas/commongen/

#image
python src/grounding_image/save_feat2lmdb_commongen.py
#text
python src/grounding_text/save_feat2lmdb_commongen.py

Training

The bash for training:

bash scripts/train_more.sh

The outputs will be saved in --output_dir ./res/more

Evaluation

Information about evaluation can be found at CommonGen github link

Citation

If you find our work helpful, please consider cite as follows:

  @inproceedings{Cui2024MOREMR,
      title = "{MORE}: Multi-m{O}dal {RE}trieval Augmented Generative Commonsense Reasoning",
      author = "Cui, Wanqing  and
        Bi, Keping  and
        Guo, Jiafeng  and
        Cheng, Xueqi",
      booktitle = "Findings of the Association for Computational Linguistics ACL 2024",
      month = aug,
      year = "2024",
      address = "Bangkok, Thailand and virtual meeting",
      publisher = "Association for Computational Linguistics",
      url = "https://aclanthology.org/2024.findings-acl.69",
      doi = "10.18653/v1/2024.findings-acl.69",
      pages = "1178--1192",
}

License

The code is licensed under the MIT license and the crawled dataset is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MORE (Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning)

Install

Retrieved Data

Get Features

Training

Evaluation

Citation

License

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
datas/commongen		datas/commongen
scripts		scripts
src		src
README.md		README.md
requirements.txt		requirements.txt

VickiCui/MORE

Folders and files

Latest commit

History

Repository files navigation

MORE (Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning)

Install

Retrieved Data

Get Features

Training

Evaluation

Citation

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages