Class PdfDataExtractor - lublak/pdfdataextract GitHub Wiki
the extractor for the data of the pdf
• get fingerprint(): string
get the fingerprint
string
the fingerprint
• get pages(): number
get the number of pages
number
the number of pages
▸ close(): Promise<void>
close the extractor
Promise<void>
a promise that is resolved when destruction is completed
▸ getMetadata(): Promise<null | { info: Info ; metadata: Metadata }>
get the metadata
Promise<null | { info: Info ; metadata: Metadata }>
| null>} a promise that is resolved with a {{ info: Info; metadata: Metadata; } | null} object with information from the metadata section
▸ getOutline(): Promise<null | Outline[]>
get the outline/bookmarks
Promise<null | Outline[]>
a promise that is resolved with a {Outline[]} array with information from the tree outline
▸ getPermissions(): Promise<null | Permissions>
get the permission flags
Promise<null | Permissions>
a promise that is resolved with a {Permissions | null} object that contains the permission flags for the PDF
▸ getText(pages?, sort?): Promise<string[]>
get the text
| Name | Type |
|---|---|
pages? |
number | number[] | (pageNumber: number) => boolean
|
sort? |
boolean |
Promise<string[]>
a promise that is resolved with a {string[]} array with the extracted text per page
▸ Static get(data, options?): Promise<PdfDataExtractor>
get the extractor for the data
| Name | Type | Description |
|---|---|---|
data |
Uint8Array |
the binary data file |
options |
PdfDataExtractorOptions | - |
Promise<PdfDataExtractor>
a promise that is resolved with a {PdfDataExtractor} object to pull the extracted data from