Skip to main content
Transform raw product data into complete, structured catalogs. Every field is validated, sourced, and ready for production. Fully compatible with your existing PIM and ERP schemas.

Why Rastro?

Auto-Scaling Infrastructure

Processing 30,000+ SKUs is complex — managing queues, handling failures, rate limiting, and scaling workers. Rastro handles all of this for you. Just send your data and we process it at scale.

Complex Sources with Full Traceability

Every enriched value includes sources showing where data was found, plus optional explanations of how values were derived. No black boxes—audit any data point back to its origin. Sources can be web URLs, input fields, PDFs, or any data you provide:
{
  "material": {
    "value": "316 Stainless Steel",
    "sources": ["https://mcmaster.com/catalog/91251A"],
    "source_explanation": "Extracted from product title and specifications table"
  },
  "bore_diameter": {
    "value": "25 mm",
    "sources": ["INPUT:product_title"],
    "source_explanation": "Extracted from product title field"
  },
  "datasheet_url": {
    "value": "https://example.com/spec.pdf",
    "sources": ["PDF:spec.pdf:page_2"],
    "source_explanation": "Extracted from specifications table on page 2"
  }
}
Set "web_search": false to skip web lookups and only process your input data.

Automatic Error Flagging

AI flags uncertain values, conflicting data, and potential issues for human review. Every result includes reasoning and confidence indicators.
{
  "review_info": {
    "reasoning": "Found conflicting specs across sources",
    "confidence": "medium",
    "flags": ["conflicting_data", "verify_dimensions"]
  }
}

Taxonomy & Category Prediction

Automatically classify items into your taxonomy with attribute extraction. Define your category hierarchy once—Rastro assigns items and extracts category-specific attributes.
{
  "category_path": "Fasteners > Bolts > Hex Bolts",
  "taxonomy_attributes": {
    "Thread Size": "M10-1.5",
    "Head Type": "Hex",
    "Grade": "A2-70"
  }
}

Complex Field Types

Define fields with units, enums, arrays, and validation constraints. Rastro extracts structured data matching your exact schema.
{
  "output_schema": [
    {"name": "weight", "type": "number", "unit": "kg"},
    {"name": "material", "type": "string", "enum": ["Steel", "Aluminum", "Brass"]},
    {"name": "certifications", "type": "array", "items_enum": ["ISO 9001", "CE", "UL"]}
  ]
}

Quality Scoring

Get a 1-5 readiness score for each item based on your criteria. Identify incomplete records before they hit production.
{
  "quality_score": 4,
  "quality_result": {
    "explanation": "Complete specs, missing datasheet URL",
    "issues": ["No datasheet link"],
    "suggestions": ["Add manufacturer datasheet"]
  }
}