mantimon-tcg/backend/data/raw
Cal Corum c6e3695760 Fix card data pipeline: fossil cards and text artifacts
Scraper fixes:
- Detect fossil cards (Helix/Dome Fossil, Old Amber) as Trainer/Item cards
- Add text artifact cleaning for stripped energy icons:
  - 'aEnergy' -> 'an Energy'
  - 'extraEnergy' -> 'extra Energy'
  - 'BenchedPokémon' -> 'Benched Pokémon'
  - And 20+ other common patterns

Converter improvements:
- Add evolution chain validation to detect broken evolves_from references
- Track conversion errors and validation warnings in _index.json
- Return errors from convert_set() for better debugging

Data fixes:
- Fixed 4 fossil cards (now correctly typed as trainer/item)
- Fixed text artifacts in 46 raw card files
- Regenerated all 382 card definitions
- All evolution chains now valid

Added fix_raw_text.py utility script for batch text cleanup.
2026-01-27 14:37:03 -06:00
..
a1 Fix card data pipeline: fossil cards and text artifacts 2026-01-27 14:37:03 -06:00
a1a Fix card data pipeline: fossil cards and text artifacts 2026-01-27 14:37:03 -06:00
_index.json Fix card data pipeline: fossil cards and text artifacts 2026-01-27 14:37:03 -06:00
.gitkeep Add CardService and card data conversion pipeline 2026-01-27 14:16:40 -06:00
README.md Add CardService and card data conversion pipeline 2026-01-27 14:16:40 -06:00

Raw Scraped Data

Scraped from pokemon-zone.com. Reference only - do not edit.

Run scripts/scrape_pokemon_pocket.py to update this data.

For authoritative card definitions used by the game engine, see ../definitions/.

Structure

raw/
├── _index.json          # Index of all scraped cards
├── a1/                  # Genetic Apex set
│   └── *.json           # Individual card files
└── a1a/                 # Mythical Island set
    └── *.json           # Individual card files

Notes

  • This data is the raw output from the scraper
  • Schema may differ from the game engine's CardDefinition model
  • Use scripts/convert_cards.py to transform this data into definitions