OneEncoder for text, image and audio with temperature=2.5.

Browse files

Files changed (2) hide show

README.md +3 -36
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -2,41 +2,8 @@
 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
-language:
-- en
-base_model:
-- google/vit-base-patch16-224
-- google-bert/bert-base-uncased
-- facebook/wav2vec2-base-960h
 ---
-# 🖼️📝 OneEncoder: A Unified Text & Image & Audio Model
-**OneEncoder** is a lightweight framework for cross-modal alignment, focusing on efficiently integrating **text, image and audio** (with future extensions to other modalities). Unlike traditional methods relying on massive modality-specific encoders, OneEncoder progressively aligns different data types, making it cost-effective and performant even on small paired datasets.
-## 🚀 Key Features
-✅ **Multimodal Alignment**: Initially supports **text & image & audio**, with extension to other modalities.
-✅ **Lightweight & Efficient**: Avoids full retraining when adding new modalities.
-✅ **Superior Performance**: Outperforms models that require large specialized datasets.
-## 🎯 Applications
-- **Visual Question Answering (VQA)**
-- **Image-Text-Audio Retrieval**
-- **Multimodal Content Understanding**
-## 📄 Research Paper
-📜 **arXiv**: [OneEncoder: Progressive Cross-Modal Alignment](https://arxiv.org/abs/2409.11059)
-## 📌 Resources
-🔗 **GitHub Repo**: [OneEncoder](https://github.com/b-faye/OneEncoder)
-🚀 **Hugging Face Demo**: [OneEncoder Retriever](https://huggingface.co/spaces/bilalfaye/OneEncoder-retriever)
-📓 **Demo Notebook**: [OneEncoder Demos](https://github.com/b-faye/OneEncoder/tree/main/demo)
-🔊 **OneEncoder for Text, Image**: [HF Model](https://huggingface.co/bilalfaye/OneEncoder-text-image)
-🔊 **OneEncoder for Text, Image & Video**: [HF Model](https://huggingface.co/bilalfaye/OneEncoder-text-image-video)
-🔊 **OneEncoder for Text, Image & X-ray**: [HF Model](https://huggingface.co/bilalfaye/OneEncoder-text-image-xray)
-## 📝 Authors
-📌 **Bilal FAYE**, Hanane AZZAG, Mustapha LEBBAH, Djamel BOUCHAFFRA
-**Note: This model is training with temperature=2.5 and addition as fusion operation**

 tags:
 - model_hub_mixin
 - pytorch_model_hub_mixin
 ---
+This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
+- Library: [More Information Needed]
+- Docs: [More Information Needed]

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:01a06ebec4744b25977481615b0524d2dc335db0d1513f55b11b3083a42cef29
 size 1172721276

 version https://git-lfs.github.com/spec/v1
+oid sha256:530a009e0c27f814bdef9b0092dc96a3194b90928fab31cabed0140a719789a7
 size 1172721276