News

Google News
llamaindex. ai > blog > how-to-make-a-pdf-searchable

How to Make a PDF Searchable: Methods and Limits

2+ hour, 26+ min ago  (410+ words) OSS repos trusted by millions of developers What "Searchable" Means: Two Layers, One of Them Invisible The fastest way to make a PDF searchable takes about four clicks in Adobe Acrobat: open the file, run Scan & OCR, recognize text, save....

Symbols: btc-usd,nasdaq:pdfs
Llama Index
llamaindex. ai > glossary > code-block-extraction

What is Code Block Extraction?

6+ day, 8+ hour ago  (306+ words) OSS repos trusted by millions of developers How Code Block Extraction Works Code block extraction targets and isolates code content from within a larger body of text. Rather than processing an entire document as undifferentiated content, extraction logic locates the…...

Symbols: symbol:albld
Llama Index
llamaindex. ai > glossary > header-detection

What is Header Detection?

6+ day, 8+ hour ago  (457+ words) OSS repos trusted by millions of developers What Header Detection Means Across Different Contexts "Detection" in this context means the process by which a system locates, reads, and interprets that structured block'distinguishing it from surrounding content and extracting the information…...

Symbols: tsxv:epl,nyse:www,symbol:since
Llama Index
llamaindex. ai > glossary > bold-and-italic-detection

What is Bold and Italic Detection?

6+ day, 8+ hour ago  (426+ words) OSS repos trusted by millions of developers What Bold and Italic Detection Actually Means Bold and italic detection is the process of identifying text formatted with bold or italic styling within a document, image, or digital file, distinguishing it from…...

Symbols: nse:fsl,btc-usd,eth-usd
Llama Index
llamaindex. ai > glossary > reading-order-detection

What is Reading Order Detection?

6+ day, 8+ hour ago  (585+ words) OSS repos trusted by millions of developers What Reading Order Detection Actually Does Getting this right matters for accessibility compliance, screen reader compatibility, and any downstream process that depends on coherent, logically ordered text. Reading order detection determines the logical…...

Symbols: btc-usd,eth-usd
Llama Index
llamaindex. ai > glossary > nested-table-parsing

What is Nested Table Parsing?

6+ day, 8+ hour ago  (472+ words) OSS repos trusted by millions of developers Nested Tables: Structure and Format Breakdown A nested table is a table that exists inside a cell of another table. The outer table is the parent; the table embedded within one of its…...

Symbols: tpc-ds,htwo.pvt,btc-usd
Llama Index
llamaindex. ai > glossary > latex-extraction-from-pdf

What is La Te X Extraction from PDF?

6+ day, 8+ hour ago  (532+ words) OSS repos trusted by millions of developers The table below provides a side-by-side comparison of the most widely used tools for PDF-to-La Te X extraction. Use it to identify the best fit for your document type and requirements. Teams that…...

Symbols: btc-usd
Llama Index
llamaindex. ai > glossary > merged-cell-extraction

What Is Merged Cell Extraction?

6+ day, 8+ hour ago  (558+ words) OSS repos trusted by millions of developers Why Merged Cells Break Extraction A merged cell is created when two or more adjacent cells in a spreadsheet or table are combined into a single display unit. Visually, the merged cell spans…...

Symbols: nasdaq:clls,ticker:clls,ticker:alcls
Llama Index
llamaindex. ai > blog > building-a-financial-document-pipeline-with-llamaparse

Build a Loan Underwriting Pipeline with Llama Parse

1+ week, 3+ day ago  (483+ words) OSS repos trusted by millions of developers The Workshop Tech Stack Loan underwriting requires pulling data from multiple financial documents. This often includes pay stubs and brokerage statements, all with complex layouts that will vary widely across providers. This is…...

Symbols: btc-usd
Llama Index
llamaindex. ai > glossary > legal-due-diligence-ai

What is Legal Due Diligence AI?

1+ week, 6+ day ago  (658+ words) OSS repos trusted by millions of developers What Legal Due Diligence AI Actually Does Legal Due Diligence AI addresses this directly by applying machine learning, Natural Language Processing (NLP), and optical character recognition (OCR) to automate and accelerate the review…...

Symbols: gpn-ai,nyse:dash,nyse:law,lse:htg