https://github.com/animesh/neighborhood-attention-transformer
[Preprint] Neighborhood Attention Transformer, 2022
https://github.com/animesh/neighborhood-attention-transformer
Science Score: 10.0%
This score indicates how likely this project is to be science-related based on various indicators:
-
○CITATION.cff file
-
○codemeta.json file
-
○.zenodo.json file
-
○DOI references
-
✓Academic publication links
Links to: arxiv.org -
○Academic email domains
-
○Institutional organization owner
-
○JOSS paper metadata
-
○Scientific vocabulary similarity
Low similarity (6.9%) to scientific vocabulary
Last synced: 10 months ago
·
JSON representation
Repository
[Preprint] Neighborhood Attention Transformer, 2022
Basic Info
- Host: GitHub
- Owner: animesh
- License: mit
- Default Branch: main
- Homepage: https://arxiv.org/abs/2204.07143
- Size: 24.4 MB
Statistics
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
- Releases: 0
Fork of SHI-Labs/Neighborhood-Attention-Transformer
Created almost 4 years ago
· Last pushed almost 4 years ago
https://github.com/animesh/Neighborhood-Attention-Transformer/blob/main/
# Neighborhood Attention Transformers![]()
[
](NATTEN.md)   **Powerful hierarchical vision transformers based on sliding window attention.** Neighborhood Attention (NA, local attention) was introduced in our original paper, [NAT](NAT.md), and runs efficiently with our CUDA extension to PyTorch, [NATTEN](NATTEN.md). We recently introduced a new model, [DiNAT](DiNAT.md), which extends NA by dilating neighborhoods (DiNA, sparse global attention). Combinations of NA/DiNA are capable of preserving locality, expanding the receptive field exponentially, and capturing longer-range inter-dependencies, leading to significant performance boosts in downstream vision tasks. # Dilated Neighborhood Attention :fire:   A new hierarchical vision transformer based on Neighborhood Attention (local attention) and Dilated Neighborhood Attention (sparse global attention) that enjoys significant performance boost in downstream tasks. Check out the [DiNAT README](DiNAT.md). # Neighborhood Attention Transformer   Our original paper, [Neighborhood Attention Transformer (NAT)](NAT.md), the first efficient sliding-window local attention. # How Neighborhood Attention works Neighborhood Attention localizes the query token's (red) receptive field to its nearest neighboring tokens in the key-value pair (green). This is equivalent to dot-product self attention when the neighborhood size is identical to the image dimensions. Note that the edges are special (edge) cases.   # News ### September 29, 2022 * New preprint: [Dilated Neighborhood Attention Transformer](DiNAT.md). * [NA CUDA extension v0.13](NATTEN.md) released with dilation support! * See [changelog](CHANGELOG.md). ### July 9, 2022 * [NA CUDA extension v0.12](NATTEN.md) released. * NA runs much more efficiently now, up to 40% faster and uses up to 25% less memory compared to Swin Transformers Shifted Window Self Attention. * Improved FP16 throughput. * Improved training speed and stability. * See [changelog](CHANGELOG.md). ### May 12, 2022 * [1-D Neighborhood Attention](NATTEN.md) support added! * Moved the kernel to `natten/` now, since there's a single version for all three tasks, and we're adding more features to the extension. ### April 30, 2022 * [NA CUDA extension v0.11](NATTEN.md) released. * It's faster in both training and inference, * with a single version for all three tasks (no downstream-specific version) * [PyTorch implementation](NATTEN.md) released * Works both with and without CUDA, but not very efficient. Try to use the CUDA extension when possible. * See [changelog](CHANGELOG.md). # Catalog - [x] Neighborhood Attention 1D (CUDA) - [x] Neighborhood Attention 2D (CUDA) - [ ] Neighborhood Attention 1D (PyTorch) - [x] Neighborhood Attention 2D (PyTorch) - [x] Dilation support - [ ] BFloat16 support (coming soon) - [ ] Zeros/Valid padding support (coming soon) - [ ] HuggingFace Demo # Citation ```bibtex @article{hassani2022neighborhood, title = {Neighborhood Attention Transformer}, author = {Ali Hassani and Steven Walton and Jiachen Li and Shen Li and Humphrey Shi}, year = 2022, url = {https://arxiv.org/abs/2204.07143}, eprint = {2204.07143}, archiveprefix = {arXiv}, primaryclass = {cs.CV} } @article{hassani2022dilated, title = {Dilated Neighborhood Attention Transformer}, author = {Ali Hassani and Humphrey Shi}, year = 2022, url = {https://arxiv.org/abs/2209.15001}, eprint = {2209.15001}, archiveprefix = {arXiv}, primaryclass = {cs.CV} } ```
Owner
- Name: Ani
- Login: animesh
- Kind: user
- Location: Norway
- Company: Norwegian University of Science and Technology
- Website: https://www.fuzzylife.org
- Twitter: animesh1977
- Repositories: 749
- Profile: https://github.com/animesh
A medical graduate from Delhi University with post-graduation in bioinformatics from Jawaharlal Nehru University, India.