entity_extractor
Entity Extractor
Main pipeextract_entities
Install
$
npx mthds install https://github.com/pipelex/methods --method entity_extractorRun
$
npx mthds run method entity_extractorEntity Extractor
Extract named entities (persons, organizations, dates, amounts, locations, products, regulations) from any document.
What it does
Given a document (PDF, Word, etc.), this method:
- Extracts content from the document into readable pages
- Identifies all named entities with type classification, surrounding context, and normalized values
Concepts
- Entity: A named entity with
name,entity_type,context, and optionalnormalized_value
Exports
entity_extraction
extract_entitiesidentify_entities
Method Flowchart
Loading diagram…
Bundle Files
Select a file to view its contents
METHODS.toml
METHODS.toml
[package]
name = "entity_extractor"
display_name = "Entity Extractor"
address = "github.com/pipelex/methods"
version = "0.1.2"
description = "# Entity Extractor\n\nExtract named entities (persons, organizations, dates, amounts, locations, products, regulations) from any document.\n\n## What it does\n\nGiven a **document** (PDF, Word, etc.), this method:\n\n1. **Extracts** content from the document into readable pages\n2. **Identifies** all named entities with type classification, surrounding context, and normalized values\n\n## Concepts\n\n- **Entity**: A named entity with `name`, `entity_type`, `context`, and optional `normalized_value`"
license = "MIT"
authors = ["Evotis S.A.S"]
main_pipe = "extract_entities"
[exports.entity_extraction]
pipes = ["extract_entities", "identify_entities"]