Repository navigation

cross-attention

Website
Wikipedia

bloc97 / CrossAttentionControl

Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion

cross-attention 深度学习 diffusion-models stable-diffusion

Jupyter Notebook

1341

3 年前

autonomousvision / unimatch

[TPAMI'23] Unifying Flow, Stereo and Depth Estimation

cross-attention depth stereo transformer matching unified-model optical-flow correspondence

Python

1300

130

9 个月前

unum-cloud / uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

huggingface-transformers language-vision multimodal PyTorch semantic-search transformer cross-attention vector-search bert 神经网络 pretrained-models multi-lingual clip openai contrastive-learning representation-learning clustering image-search llava

Python

1179

1 个月前

HaozheLiu-ST / T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

cross-attention diffusers diffusion efficiency inference PyTorch text2image transformer

Python

407

7 个月前

wooyeolbaek / attention-map-diffusers

🚀 Cross attention map tools for huggingface/diffusers

cross-attention diffusers visualization huggingface stable-diffusion text-to-image

Python

347

9 个月前

lucidrains / CALM-pytorch

Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind

人工智能 attention-mechanisms cross-attention 深度学习 transformers

Python

179

1 年前

aliasgharkhani / SLiMe

1-shot image segmentation using Stable Diffusion

cross-attention image-segmentation Python PyTorch self-attention stable-diffusion

Python

141

2 年前

xiaogang00 / LLVE_STCD

This is the project for the paper of "Low-Light Video Enhancement via Spatial-Temporal Consistent Decomposition" in IJCAI2025

cross-attention

Python

2 个月前

continental / 6Img-to-3D

[IV 2025, Oral] Official code of "6Img-to-3D: Few-Image Large-Scale Outdoor Novel View Synthesis"

cross-attention novel-view-synthesis

Python

1 个月前

laowu-code / iTansformer_LSTM_CA_KAN

This is the implementation of the paper Enhanced Photovoltaic Power Forecasting: An iTransformer and LSTM-Based Model Integrating Temporal and Covariate Interactions

cross-attention forecasting lstm multivariate optuna pv solar

Python

4 个月前

akashe / Multimodal-action-recognition

Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.

multimodal-deep-learning multimodality multimodal-learning cross-attention

Python

4 年前

KHU-VLL / CAST

[NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"

cross-attention representation-learning Video

Python

2 年前

EnergyAttention / Energy-Based-CrossAttention

The official repository of "Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models".

cross-attention energy-based-model PyTorch stable-diffusion

Python

2 年前

timbroed / HRFuser

[ITSC-2023] HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection

机器视觉 cross-attention object-detection sensor-fusion transformer

Python

2 年前

augustwester / transformer-xl

A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)

transformer transformer-xl PyTorch xlnet cross-attention 自然语言处理 self-attention

Python

3 年前

cent664 / SSRIW

Tensorflow implementation of 'Robust Image Watermarking based on Cross-Attention and Invariant Domain Learning'

cross-attention 深度学习 self-supervised-learning

Jupyter Notebook

4 个月前

jhakrraman / rt-xnet

[ICIP 2025] Official implementation of RT-X Net: RGB-Thermal cross attention network for Low-Light Image Enhancement

机器视觉 image-enhancement image-restoration cross-attention multimodal multimodal-learning transformers

Python

2 个月前

ameencaslam / deepfake-detection-project-v4

Detect Deepfaked Faces Using Multiple Deeplearning Models

人工智能 classification cross-attention deepfake-detection 深度学习 efficientnet Open Source swin-transformer

Python

10 个月前

lanl / EPBD-BERT

Transcription factor binding site prediction for novel DNA sequence data aiding in mutation identification and drug discovery

cross-attention multi-modal

Jupyter Notebook

1 年前

oppolla / Self-Organizing-Virtual-Lifeform

SOVL System (Self-Organizing Virtual Lifeform): A complex, purpose-agnostic autonomous agent with continuous, asynchronous learning capabilities via a dynamic scaffolded LLM and a frozen base LLM

autonomous-agents cross-attention

Python

4 个月前