memoria-pytorch

Memoria is a human-inspired memory architecture for neural networks.

https://github.com/cosmoquester/memoria

Keywords

human-inspired memory transformer

Last synced: 6 months ago · JSON representation ·

Repository

Memoria is a human-inspired memory architecture for neural networks.

Basic Info

Host: GitHub
Owner: cosmoquester
License: mit
Language: Python
Default Branch: master
Homepage: https://arxiv.org/abs/2310.03052
Size: 5.34 MB

Statistics

Stars: 74
Watchers: 5
Forks: 5
Open Issues: 0
Releases: 2

Topics

human-inspired memory transformer

Created about 3 years ago · Last pushed over 1 year ago

Metadata Files

Readme License Citation

Memoria

Making neural networks remember over the long term has been a longstanding issue. Although several external memory techniques have been introduced, most focus on retaining recent information in the short term. Regardless of its importance, information tends to be fatefully forgotten over time. We present Memoria, a memory system for artificial neural networks, drawing inspiration from humans and applying various neuroscientific and psychological theories. The experimental results prove the effectiveness of Memoria in the diverse tasks of sorting, language modeling, and classification, surpassing conventional techniques. Engram analysis reveals that Memoria exhibits the primacy, recency, and temporal contiguity effects which are characteristics of human memory.

Memoria is an independant module which can be applied to neural network models in various ways and the experiment code of the paper is in the experiment directory.

My paper Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory Architecture is accepted to International Conference on Machine Learning (ICML) 2024 as a Spotlight paper. The full text of the paper can be accessed from OpenReview or ArXiv.

Installation

sh $ pip install memoria-pytorch

You can install memoria by pip command above.

Tutorial

This is a tutorial to help to understand the concept and mechanism of Memoria.

1. Import Memoria and Set Parameters

```python import torch from memoria import Memoria, EngramType

torch.manual_seed(42)

Memoria Parameters

numremindedstm = 4 stmcapacity = 16 ltmsearchdepth = 5 initiallifespan = 3 numfinalltms = 4

Data Parameters

batchsize = 2 sequencelength = 8 hidden_dim = 64 ```

2. Initialize Memoria and Dummy Data

Fake random data and lifespan delta are used for simplification.

python memoria = Memoria( num_reminded_stm=num_reminded_stm, stm_capacity=stm_capacity, ltm_search_depth=ltm_search_depth, initial_lifespan=initial_lifespan, num_final_ltms=num_final_ltms, ) data = torch.rand(batch_size, sequence_length, hidden_dim)

3. Add Data as Working Memory

```python

Add data as working memory

memoria.addworkingmemory(data) ```

```python

Expected values

len(memoria.engrams) 16 memoria.engrams.data.shape torch.Size([2, 8, 64]) memoria.engrams.lifespan tensor([[3., 3., 3., 3., 3., 3., 3., 3.], [3., 3., 3., 3., 3., 3., 3., 3.]]) ```

4. Remind Memories

Empty memories are reminded because there is no engrams in STM/LTM yet

python reminded_memories, reminded_indices = memoria.remind()

```python

No reminded memories because there is no STM/LTM engrams yet

remindedmemories tensor([], size=(2, 0, 64)) remindedindices tensor([], size=(2, 0), dtype=torch.int64) ```

5. Adjust Lifespan and Memories

In this step, no engrams earn lifespan because there is no reminded memories

python memoria.adjust_lifespan_and_memories(reminded_indices, torch.zeros_like(reminded_indices))

```python

Decreases lifespan for all engrams & working memories have changed into shortterm memory

memoria.engrams.lifespan tensor([[2., 2., 2., 2., 2., 2., 2., 2.], [2., 2., 2., 2., 2., 2., 2., 2.]]) memoria.engrams.engrams_types tensor([[2, 2, 2, 2, 2, 2, 2, 2], [2, 2, 2, 2, 2, 2, 2, 2]], dtype=torch.uint8) EngramType.SHORTTERM ```

6. Repeat one more time

Now, there are some engrams in STM, remind and adjustment from STM will work

python data2 = torch.rand(batch_size, sequence_length, hidden_dim) memoria.add_working_memory(data2)

```python

len(memoria.engrams) 32 memoria.engrams.lifespan tensor([[2., 2., 2., 2., 2., 2., 2., 2., 3., 3., 3., 3., 3., 3., 3., 3.], [2., 2., 2., 2., 2., 2., 2., 2., 3., 3., 3., 3., 3., 3., 3., 3.]]) ```

python reminded_memories, reminded_indices = memoria.remind()

```python

Remind memories from STM

remindedmemories.shape torch.Size([2, 6, 64]) remindedindices.shape torch.Size([2, 6]) reminded_indices tensor([[ 0, 6, 4, 3, 2, -1], [ 0, 7, 6, 5, 4, -1]]) ```

```python

Increase lifespan of all the reminded engrams by 5

memoria.adjustlifespanandmemories(remindedindices, torch.fulllike(remindedindices, 5)) ```

```python

Reminded engrams got lifespan by 5, other engrams have got older

memoria.engrams.lifespan memoria.engrams.lifespan tensor([[6., 1., 6., 6., 6., 1., 6., 1., 2., 2., 2., 2., 2., 2., 2., 2.], [6., 1., 1., 1., 6., 6., 6., 6., 2., 2., 2., 2., 2., 2., 2., 2.]]) ```

7. Repeat

Repeat 10 times to see the dynamics of LTM

```python

This is default process to utilize Memoria

for _ in range(10): data = torch.rand(batchsize, sequencelength, hiddendim) memoria.addworking_memory(data)

reminded_memories, reminded_indices = memoria.remind()

lifespan_delta = torch.randint_like(reminded_indices, 0, 6).float()

memoria.adjust_lifespan_and_memories(reminded_indices, lifespan_delta)

```

```python

After 10 iteration, some engrams have changed into longterm memory and got large lifespan

Engram type zero means those engrams are deleted

len(memoria.engrams) 72 memoria.engrams.engrams_types tensor([[3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2], [0, 0, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2]], dtype=torch.uint8) EngramType.LONGTERM EngramType.NULL memoria.engrams.lifespan tensor([[ 9., 1., 8., 2., 16., 5., 13., 7., 7., 3., 3., 4., 3., 3., 4., 2., 2., 1., 1., 1., 1., 1., 1., 1., 2., 6., 1., 1., 2., 2., 2., 2., 2., 2., 2., 2.], [-1., -1., 3., 2., 19., 21., 11., 6., 14., 1., 5., 1., 5., 1., 5., 1., 1., 8., 2., 1., 1., 1., 2., 1., 1., 1., 1., 1., 2., 2., 2., 2., 2., 2., 2., 2.]]) ```

Citation

bibtex @InProceedings{pmlr-v235-park24a, title = {Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory Architecture}, author = {Park, Sangjun and Bak, Jinyeong}, booktitle = {Proceedings of the 41st International Conference on Machine Learning}, pages = {39587--39615}, year = {2024}, editor = {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix}, volume = {235}, series = {Proceedings of Machine Learning Research}, month = {21--27 Jul}, publisher = {PMLR}, pdf = {https://raw.githubusercontent.com/mlresearch/v235/main/assets/park24a/park24a.pdf}, url = {https://proceedings.mlr.press/v235/park24a.html}, abstract = {Making neural networks remember over the long term has been a longstanding issue. Although several external memory techniques have been introduced, most focus on retaining recent information in the short term. Regardless of its importance, information tends to be fatefully forgotten over time. We present Memoria, a memory system for artificial neural networks, drawing inspiration from humans and applying various neuroscientific and psychological theories. The experimental results prove the effectiveness of Memoria in the diverse tasks of sorting, language modeling, and classification, surpassing conventional techniques. Engram analysis reveals that Memoria exhibits the primacy, recency, and temporal contiguity effects which are characteristics of human memory.} }

Owner

Name: ParkSangJun
Login: cosmoquester
Kind: user
Location: Seoul, Korea
Company: @scatterlab @pingpong-ai

Website: https://cosmoquester.github.io
Repositories: 12
Profile: https://github.com/cosmoquester

Machine Learning Engineer @scatterlab Korea. Thank you.

Citation (CITATION.bib)

@InProceedings{pmlr-v235-park24a,
  title = 	 {Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory Architecture},
  author =       {Park, Sangjun and Bak, Jinyeong},
  booktitle = 	 {Proceedings of the 41st International Conference on Machine Learning},
  pages = 	 {39587--39615},
  year = 	 {2024},
  editor = 	 {Salakhutdinov, Ruslan and Kolter, Zico and Heller, Katherine and Weller, Adrian and Oliver, Nuria and Scarlett, Jonathan and Berkenkamp, Felix},
  volume = 	 {235},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {21--27 Jul},
  publisher =    {PMLR},
  pdf = 	 {https://raw.githubusercontent.com/mlresearch/v235/main/assets/park24a/park24a.pdf},
  url = 	 {https://proceedings.mlr.press/v235/park24a.html},
  abstract = 	 {Making neural networks remember over the long term has been a longstanding issue. Although several external memory techniques have been introduced, most focus on retaining recent information in the short term. Regardless of its importance, information tends to be fatefully forgotten over time. We present Memoria, a memory system for artificial neural networks, drawing inspiration from humans and applying various neuroscientific and psychological theories. The experimental results prove the effectiveness of Memoria in the diverse tasks of sorting, language modeling, and classification, surpassing conventional techniques. Engram analysis reveals that Memoria exhibits the primacy, recency, and temporal contiguity effects which are characteristics of human memory.}
}

GitHub Events

Total

Release event: 1
Watch event: 19
Delete event: 5
Issue comment event: 2
Push event: 7
Pull request event: 10
Fork event: 2
Create event: 5

Last Year

Release event: 1
Watch event: 19
Delete event: 5
Issue comment event: 2
Push event: 7
Pull request event: 10
Fork event: 2
Create event: 5

Packages

Total packages: 1
Total downloads:
- pypi 17 last-month

Total dependent packages: 0
Total dependent repositories: 0
Total versions: 2
Total maintainers: 1

pypi.org: memoria-pytorch

Memoria is a human-inspired memory architecture for neural networks.

Homepage: https://github.com/cosmoquester/memoria.git
Documentation: https://memoria-pytorch.readthedocs.io/
License: MIT License
Latest release: 1.1.0
published over 1 year ago

Versions: 2
Dependent Packages: 0
Dependent Repositories: 0
Downloads: 17 Last month

Rankings

Dependent packages count: 7.3%

Average: 37.9%

Dependent repos count: 68.5%

Maintainers (1)

psj8252

Last synced: 7 months ago

memoria-pytorch

Science Score: 41.0%

Keywords

Repository

Basic Info

Statistics

Topics

Metadata Files

README.md

Memoria

Installation

Tutorial

1. Import Memoria and Set Parameters

Memoria Parameters

Data Parameters

2. Initialize Memoria and Dummy Data

3. Add Data as Working Memory

Add data as working memory

Expected values

4. Remind Memories

No reminded memories because there is no STM/LTM engrams yet

5. Adjust Lifespan and Memories

Decreases lifespan for all engrams & working memories have changed into shortterm memory

6. Repeat one more time

Remind memories from STM

Increase lifespan of all the reminded engrams by 5

Reminded engrams got lifespan by 5, other engrams have got older

7. Repeat

This is default process to utilize Memoria

After 10 iteration, some engrams have changed into longterm memory and got large lifespan

Engram type zero means those engrams are deleted

Citation

Owner

Citation (CITATION.bib)

GitHub Events

Total

Last Year

Packages

pypi.org: memoria-pytorch

Rankings

Maintainers (1)