Article Extraction
Project Likkutei Sichos | July 29, 2024
Print This Article
View Original PDF

Article Extraction

Project Likkutei Sichos | December 10, 2025

Return a single JSON array of objects. Each object must contain:

  • "title": The article's unique main heading (clean text only, no special characters).
  • "content": Full article text, formatted in HTML using:
    • <p> for paragraphs
    • <strong> for bold
    • <em> for italics
    • <h2>/<h3> for subheadings
    Preserve original formatting and structure. Do NOT truncate, summarize, or add ellipses.
  • "tags": Array of at least 5 relevant tags/keywords derived from article content.
  • "printout_title": The title of the printout document.
  • "source_url": The original printout's URL or reference.
  • "beginning": Integer (1–5). Is the start logical?
  • "ending": Integer (1–5). Does the article end with punctuation or closure?
  • "completeness": Integer (1–5). Is the article a complete thought?
  • "readability": Integer (1–5). Does the article make sense independently?

Return a single JSON array of objects. Each object must contain:

  • "title": The article's unique main heading (clean text only, no special characters).
  • "content": Full article text, formatted in HTML using:
    • <p> for paragraphs
    • <strong> for bold
    • <em> for italics
    • <h2>/<h3> for subheadings
    Preserve original formatting and structure. Do NOT truncate, summarize, or add ellipses.
  • "tags": Array of at least 5 relevant tags/keywords derived from article content.
  • "printout_title": The title of the printout document.
  • "source_url": The original printout's URL or reference.
  • "beginning": Integer (1–5). Is the start logical?
  • "ending": Integer (1–5). Does the article end with punctuation or closure?
  • "completeness": Integer (1–5). Is the article a complete thought?
  • "readability": Integer (1–5). Does the article make sense independently?
PDF Preview