CLASS_VISION Telegram 574
Forwarded from Tensorflow(@CVision) (Alireza Akhavan)
#مقاله

The Evolved Transformer
The Evolved Transformer: They perform architecture search on Transformer's stackable cells for seq2seq tasks. “A much smaller, mobile-friendly, Evolved Transformer with only ~7M parameters outperforms the original Transformer by 0.7 BLEU on WMT14 EN-DE.”
https://arxiv.org/abs/1901.11117

The Evolved Transformer is twice as efficient as the Transformer in FLOPS without loss in quality.

#seq2seq



tgoop.com/class_vision/574
Create:
Last Update:

#مقاله

The Evolved Transformer
The Evolved Transformer: They perform architecture search on Transformer's stackable cells for seq2seq tasks. “A much smaller, mobile-friendly, Evolved Transformer with only ~7M parameters outperforms the original Transformer by 0.7 BLEU on WMT14 EN-DE.”
https://arxiv.org/abs/1901.11117

The Evolved Transformer is twice as efficient as the Transformer in FLOPS without loss in quality.

#seq2seq

BY 🚀 کلاس‌ویژن | یادگیری هوش مصنوعی از پایه تا پیشرفته




Share with your friend now:
tgoop.com/class_vision/574

View MORE
Open in Telegram


Telegram News

Date: |

Image: Telegram. Informative While the character limit is 255, try to fit into 200 characters. This way, users will be able to take in your text fast and efficiently. Reveal the essence of your channel and provide contact information. For example, you can add a bot name, link to your pricing plans, etc. 5Telegram Channel avatar size/dimensions Content is editable within two days of publishing
from us


Telegram 🚀 کلاس‌ویژن | یادگیری هوش مصنوعی از پایه تا پیشرفته
FROM American