TextToolboxTextToolbox
Home/Text Extraction / Analysis/Extract Emails from Text

Extract Emails from Text

Instantly extract all email addresses from any text — paste emails, documents, or HTML and get a clean, deduplicated list you can copy or export as CSV.

📝 Paste Your Text

📧 Extracted Emails

Paste text to extract email addresses.

What Is the Extract Emails from Text Tool?

The Extract Emails from Text tool instantly finds every valid email address hidden in any block of text. Paste an email thread, webpage source, spreadsheet export, or document and all email addresses are listed, deduplicated, and ready to copy or download as CSV — no account needed, all processing happens in your browser.

How to Use the Email Extractor

  1. Paste your text — emails are detected as you type, no button needed.
  2. Enable deduplication — removes repeated addresses and normalises letter case automatically.
  3. Filter by domain — click a badge in the Domain Breakdown panel or type directly in the filter box to show only addresses from one provider or company.
  4. Select individual emails — click rows to select them; use "Copy N Selected" to copy only those addresses.
  5. Copy All or Export CSV — CSV includes an "Email" header row, ready for import into Excel or Google Sheets.

Where Can You Use the Email Extractor?

📣 Marketing Teams
Pull subscriber emails from exported CRM data, feedback forms, or raw CSV dumps before importing into your email platform.
👥 HR & Recruitment
Extract candidate contact emails from application batches, LinkedIn exports, or pasted CV summaries in seconds.
🎧 Customer Support
Compile affected user addresses from log files or ticket dumps to send targeted follow-up communications.
🔬 Research & Analysis
Researchers can extract contact addresses from publications, conference proceedings, or scraped academic pages for outreach.

Email Detection Options and Features

  • Regex-based detection — matches standard RFC 5321 addresses including dots, plus signs, underscores, and hyphens in the local part. Correctly rejects malformed entries like @broken or user@.
  • Deduplication — converts all results to lowercase and removes repeated addresses, giving a clean unique list regardless of formatting inconsistencies.
  • Domain filter — type a domain (e.g. gmail.com) or click a badge in the Domain Breakdown panel to instantly filter to that provider.
  • Domain Breakdown panel — shows every domain found with its email count, sorted by frequency. Click any badge to toggle a filter.
  • Row selection — click individual email rows to select them; "Copy N Selected" copies only chosen addresses.
  • CSV Export — downloads all visible (filtered) emails with an "Email" header row, ready for spreadsheet import.

Best Practices and Limitations

The tool matches the standard local@domain.tld format and handles all TLDs of 2+ characters (.io, .co.uk, .dev, .museum). There are a few edge cases to be aware of:

Obfuscated addresses: Some websites write emails as user [at] domain [dot] com to prevent detection. These will not be matched — the tool only detects the standard @ notation.

Line-wrapped emails: If a very long email address has been split across two lines by an email client or PDF viewer, the extractor may not detect it. Join the lines manually before pasting.

Scanned PDFs: Use your PDF reader to select all text and copy it. Scanned image-based PDFs require OCR first to convert the image to text before emails can be extracted.

Share This Tool

Help teams extract email addresses from text for free!

Frequently Asked Questions

What email formats does the tool detect?+

The tool matches standard RFC 5321 email addresses: local-part@domain.tld where the local part can contain letters, digits, dots, underscores, plus signs, and hyphens. It normalises all results to lowercase and correctly rejects malformed entries like @broken or user@ that lack a complete address structure.

Can I extract emails from a PDF?+

Paste the PDF text content into the tool — most PDF readers allow you to select all text (Ctrl+A) and copy it. The tool then extracts emails from the pasted text. Scanned PDFs (image-based) require OCR first to convert the image to text before emails can be detected.

How does duplicate removal work?+

When "Remove duplicate emails" is enabled, the tool converts all emails to lowercase and then creates a Set — a JavaScript data structure that automatically discards repeated values. This means john@gmail.com and John@Gmail.com are treated as the same address and only one copy is kept.

What is the domain filter for?+

The domain filter shows only emails from a specific domain — e.g. type gmail.com to see only Gmail addresses from your text. Click a domain badge in the Domain Breakdown panel to activate the filter for that domain instantly, then click it again to clear. Useful for segmenting a mixed list by provider or company.

Is my text data sent to a server?+

No. All processing happens entirely in your browser using JavaScript. The text you paste is never uploaded, transmitted, or stored anywhere. You can verify this by turning off your internet connection — the tool continues to work perfectly offline after the page loads.