IDocumentInfo

public interface IDocumentInfo

Represents the document information.

The objects which implement this interface are returned by Parser.getDocumentInfo() method. See the usage examples there.

Learn more:

Methods

Method Description
getFileType() Gets the document type.
getPageCount() Gets the total number of document pages.
getRawPageCount() Gets the total number of document raw pages.
getSize() Gets the size of the document in bytes.
getPages() Gets the information about pages such as the index and page size.

getFileType()

public abstract FileType getFileType()

Gets the document type.

Returns: FileType - An instance of FileType class that represents the type of the document.

getPageCount()

public abstract int getPageCount()

Gets the total number of document pages.

Returns: int - An integer value that represents a total number of pages.

getRawPageCount()

public abstract int getRawPageCount()

Gets the total number of document raw pages.

Use getRawPageCount() property instead of getPageCount() property for raw text extraction. Some documents have different page numbers in accurate and raw text extraction modes. getPageCount() property may perform extra calculations which impacts on text extraction speed in raw mode.

Returns: int - An integer value that represents a total number of raw pages.

getSize()

public abstract long getSize()

Gets the size of the document in bytes.

Returns: long - An integer value that represents the size of the document in bytes.

getPages()

public abstract List<PageInfo> getPages()

Gets the information about pages such as the index and page size.

Returns: java.util.List<com.groupdocs.parser.options.PageInfo> - A collection with instances of PageInfo classes.