Using Hugging Face — transformers pipelines quickstart

Hugging Face is the central hub of the NLP community. It provides:

The Model Hub: A massive repository with tens of thousands of pretrained models for various tasks.
The transformers library: A Python library that makes it incredibly easy to download and use these models.
Other libraries like datasets, tokenizers, and peft.

For beginners, the single most powerful and easy-to-use tool in the transformers library is the pipeline() function.

The pipeline() Function

The pipeline() is a high-level abstraction that handles all the complex steps of running a model for you. It packages together a pretrained model and its necessary preprocessing steps (like tokenization). All you have to do is specify the task you want to perform and provide the input text.

Let's see it in action. First, make sure you have the library installed:

Bash

pip install transformers
# For TensorFlow or PyTorch specific functionality, you might need:
# pip install "transformers[torch]" or "transformers[tensorflow]"

Example 1: Sentiment Analysis

Want to know if a piece of text is positive or negative? The sentiment-analysis pipeline uses a model fine-tuned for this task.

Python

from transformers import pipeline

# 1. Create the pipeline for the desired task
sentiment_pipeline = pipeline("sentiment-analysis")

# 2. Use the pipeline on your data
data = ["I love the new update, it's amazing!", "This is the worst experience I've ever had."]
results = sentiment_pipeline(data)

# Print the results
for text, result in zip(data, results):
    print(f"Text: '{text}'")
    print(f"Label: {result['label']}, Score: {result['score']:.4f}\n")

The first time you run this, transformers will automatically download and cache the default model for this task.

Example 2: Text Generation

Let's use a GPT-2 model to generate some text.

Python

from transformers import pipeline

# Create the text generation pipeline
generator = pipeline("text-generation", model="gpt2") # We can specify a model

# Provide a starting prompt
prompt = "In a world where technology and magic coexisted,"
result = generator(prompt, max_length=50, num_return_sequences=1)

print(result[0]['generated_text'])

Example 3: Named Entity Recognition (NER)

NER models identify and classify entities like people, organizations, and locations in text.

Python

from transformers import pipeline

# Create the NER pipeline
ner_pipeline = pipeline("ner", grouped_entities=True)

# Use it on a sentence
text = "My name is John Doe and I work for Google in New York."
entities = ner_pipeline(text)

print(entities)
# Output might look like:
# [
#   {'entity_group': 'PER', 'score': 0.99..., 'word': 'John Doe', ...},
#   {'entity_group': 'ORG', 'score': 0.99..., 'word': 'Google', ...},
#   {'entity_group': 'LOC', 'score': 0.99..., 'word': 'New York', ...}
# ]

The pipeline() function supports many other tasks, including translation, summarization, question answering, and more. It is the perfect entry point into the world of applied Transformers, letting you achieve powerful results with minimal code.

LearnCodePro

Using Hugging Face — transformers pipelines quickstart

The pipeline() Function

Example 1: Sentiment Analysis

Example 2: Text Generation

Example 3: Named Entity Recognition (NER)

Transformer architecture — scaled dot-product attention

Pretrained models — BERT vs GPT families (concepts)

Fine-tuning vs prompt tuning — practical examples

Embeddings & semantic search basics

Transformers & Large Language Models (LLMs) — Retrieval-Augmented Generation (RAG) pattern

Quick Navigation

This Series

Topics in Data Science, Machine Learning & AI

Categories

Learn More

Want to Track Your Progress?

The pipeline() Function

Example 1: Sentiment Analysis

Example 2: Text Generation

Example 3: Named Entity Recognition (NER)

More in Transformers & Large Language Models (LLMs)

Transformer architecture — scaled dot-product attention

Pretrained models — BERT vs GPT families (concepts)

Fine-tuning vs prompt tuning — practical examples

Embeddings & semantic search basics

Transformers & Large Language Models (LLMs) — Retrieval-Augmented Generation (RAG) pattern

Quick Navigation

This Series

Topics in Data Science, Machine Learning & AI

Categories

Learn More

Want to Track Your Progress?