Name: Path Metadata Heuristics
Author: jmservera

Context

Use this when deriving title, author, year, and category from book paths before indexing into Solr.

Patterns

Honor explicit filename structure first
- Author - Title (Year).pdf → author/title/year from the filename
- Category/Author - Title (Year).pdf → category from folder, author/title/year from filename
Use folder depth to separate category vs author
- Category/Author/Title.pdf → first folder is category, second folder is author
- Author/Title.pdf → parent folder is author when the filename does not look like a series/journal issue
Handle real aithena library cases
- amades/Auca ... amades.pdf → treat amades as author and strip the repeated author suffix from the title

Use this when deriving title, author, year, and category from book paths before indexing into Solr.

Honor explicit filename structure first
- Author - Title (Year).pdf → author/title/year from the filename
- Category/Author - Title (Year).pdf → category from folder, author/title/year from filename
Use folder depth to separate category vs author
- Category/Author/Title.pdf → first folder is category, second folder is author
- Author/Title.pdf → parent folder is author when the filename does not look like a series/journal issue
Handle real aithena library cases
- amades/Auca ... amades.pdf → treat amades as author and strip the repeated author suffix from the title