Repository navigation

#

cross-attention

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

Jupyter Notebook
1339
3 年前
Python
1281
7 个月前
unum-cloud/uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Python
1161
2 个月前

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Python
405
6 个月前

🚀 Cross attention map tools for huggingface/diffusers

Python
327
7 个月前

Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind

Python
177
1 年前
Python
141
1 年前

This is the project for the paper of "Low-Light Video Enhancement via Spatial-Temporal Consistent Decomposition" in IJCAI2025

Python
82
21 天前

[IV 2025, Oral] Official code of "6Img-to-3D: Few-Image Large-Scale Outdoor Novel View Synthesis"

Python
77
2 个月前

Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.

Python
73
4 年前

This is the implementation of the paper Enhanced Photovoltaic Power Forecasting: An iTransformer and LSTM-Based Model Integrating Temporal and Covariate Interactions

Python
65
2 个月前

[NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"

Python
53
2 年前

The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".

Python
50
1 年前

[ITSC-2023] HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection

Python
38
2 年前

A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)

Python
37
3 年前

Tensorflow implementation of 'Robust Image Watermarking based on Cross-Attention and Invariant Domain Learning'

Jupyter Notebook
18
2 个月前

[ICIP 2025] Official implementation of RT-X Net: RGB-Thermal cross attention network for Low-Light Image Enhancement

Python
9
1 个月前

SOVL System (Self-Organizing Virtual Lifeform): A complex, purpose-agnostic autonomous agent with continuous, asynchronous learning capabilities via a dynamic scaffolded LLM and a frozen base LLM

Python
9
3 个月前

Transcription factor binding site prediction for novel DNA sequence data aiding in mutation identification and drug discovery

Jupyter Notebook
7
1 年前

TGRS: Code for "Unsupervised Hybrid Network of Transformer and CNN for Blind Hyperspectral and RGB Image Fusion"

Python
7
1 年前