Skip to content
Machine Learning Explained, Machine Learning Tutorials

MachineCurve

  • About MachineCurve
  • Articles
    • Deep learning
    • Other ML techniques
    • Frameworks
    • Applied AI
  • Ask Questions
  • Collections
    • About the Collections project
    • Dissecting Deep Learning (work in progress)
    • Mastering TensorFlow & Keras
    • Mastering PyTorch
    • Mastering Scikit-learn
    • Generative Adversarial Networks
    • Transformers
  • Newsletter

Tag: transformer

DialoGPT: Transformers for Dialogues

DialoGPT: Transformers for Dialogues

Chris16 March 202130 March 2021Leave a comment
DialoGPT is “a tunable gigaword-scale neural network model for generation of conversational responses, trained on Reddit data”. It uses a...
Read More
Transformers for Long Text: Code Examples with Longformer

Transformers for Long Text: Code Examples with Longformer

Chris12 March 202130 March 2021Leave a comment
Transformer models have been boosting NLP for a few years now. Every now and then, new additions make them even...
Read More
Longformer: Transformers for Long Sequences

Longformer: Transformers for Long Sequences

Chris11 March 202130 March 2021Leave a comment
Transformers have really changed the NLP world, in part due to their self-attention component. But this component is problematic in...
Read More
The TAPAS Transformer: Table Parsing with BERT

The TAPAS Transformer: Table Parsing with BERT

Chris5 March 202130 March 2021Leave a comment
TAPAS (Table Parser) is a weakly supervised Transformer-based question answering model that reasons over tables without generating logical forms. Instead,...
Read More
Easy Causal Language Modeling with Machine Learning and HuggingFace Transformers

Easy Causal Language Modeling with Machine Learning and HuggingFace Transformers

Chris3 March 202130 March 2021Leave a comment
Machine Learning in NLP is making a lot of progress. It can be used for many language tasks, primarily thanks...
Read More
What is ConvBERT and how does it work?

What is ConvBERT and how does it work?

Chris26 February 202130 March 2021Leave a comment
Convolutional BERT (ConvBERT) improves the original BERT by replacing some Multi-headed Self-attention segments with cheaper and naturally local operations, so-called...
Read More
What is the T5 Transformer and how does it work?

What is the T5 Transformer and how does it work?

Chris15 February 202130 March 2021Leave a comment
The Text-to-Text Transfer Transformer or T5 is a type of Transformer that is capable of being trained on a variety of tasks with a...
Read More
Visualizing Transformer behavior with Ecco

Visualizing Transformer behavior with Ecco

Chris19 January 202120 January 2021Leave a comment
These days, Transformer based architectures are taking the world of Natural Language Processing by storm. What’s more, even more recently,...
Read More
DALL·E: OpenAI GPT-3 model can draw pictures based on text

DALL·E: OpenAI GPT-3 model can draw pictures based on text

Chris5 January 20215 January 20211 Comment
In 2020, the GPT-3 model created by OpenAI created big headlines: it was capable of generating text that could not...
Read More
Intuitive Introduction to BERT

Intuitive Introduction to BERT

Chris4 January 202113 January 20219 Comments
Transformers are taking the world of NLP by storm. After being introduced in Vaswani et al.’s Attention is all you...
Read More

Posts navigation

1 2

Disclaimer

Although we make every effort to always display relevant, current and correct information, we cannot guarantee that the information meets these characteristics.

Privacy Policy

Stay up to date about ML developments 👨‍🎓

We post new blogs every week. Sign up to learn new things and better understand concepts you already know. We send emails every Friday.

By signing up, you consent that any information you receive can include services and special offers by email.

Follow MachineCurve.com

MachineCurve
Proudly powered by WordPress | Theme: refur by Crocoblock.