Extract academic style, build domain knowledge base (lexicon/domains), and generate style guide from PDF/TeX files.
Goal: Reverse engineer academic style from PDF/TeX files and build the psmfiles/ knowledge base.
Locate Source Files:
WebSearch to find them first.Extract Text:
python paper-extract-style/extract_text.py <file_path>ref_article/*_cleaned_body.md and ref_article/*_appendix.md.Build Knowledge Base (Incremental Fusion):
ref_article/*_cleaned_body.md.psmfiles/lexicon_domain.md:
[Method], [Metric]).LEXICON.md.psmfiles/DOMAINS_Knowledge.md:
psmfiles/STYLE_GUIDE.md:
paper-extract-style/TEMPLATES.md.*_appendix.md to define Appendix formatting standards.Completion:
psmfiles/.