Akisamb@programming.devM to

Machine Learning@programming.devEnglish · 1 year ago

Has anybody replaced attention with Hyena Hierarchy

1

1

Has anybody replaced attention with Hyena Hierarchy

Akisamb@programming.devM to

Machine Learning@programming.devEnglish · 1 year ago

1

Hyena Hierarchy seems to aim to be a drop-in replacement for attention : https://arxiv.org/pdf/2302.10866.pdf

It looks good on paper, but I haven’t been able to find anybody using it in a model. Does anyone have an example of a code or implementation ? Is there really a big improvement on long context lengths ?

You must log in or # to comment.

Chat

kraegar@programming.dev
link
fedilink
English
arrow-up
2·
1 year ago
My research area has been in time series forecasting and unsupervised anomaly detection, but it is SOMEWHAT related to NLP.

Papers with code had a few potential implementations: https://paperswithcode.com/paper/hyena-hierarchy-towards-larger-convolutional

I am always skeptical of papers. They could have good results, but how much did they adjust their experiment to look good on paper?

Machine Learning@programming.dev

machine_learning@programming.dev

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

A community for posting things related to machine learning

Icon base by Lorc under CC BY 3.0 with modifications to add a gradient

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
2 users / week
2 users / month
211 users / 6 months
1 local subscriber
478 subscribers
78 Posts
77 Comments
Modlog