ParserGetTextAreas Method (Int32) |
Namespace: GroupDocs.Parser
To extract text areas from a document page the following method is used:
// Create an instance of Parser class using(Parser parser = new Parser(filePath)) { // Check if the document supports text areas extraction if(!parser.Features.TextAreas) { Console.WriteLine("Document isn't supports text areas extraction."); return; } // Get the document info IDocumentInfo documentInfo = parser.GetDocumentInfo(); // Check if the document has pages if(documentInfo.PageCount == 0) { Console.WriteLine("Document hasn't pages."); return; } // Iterate over pages for(int pageIndex = 0; pageIndex<documentInfo.PageCount; pageIndex++) { // Print a page number Console.WriteLine(string.Format("Page {0}/{1}", pageIndex + 1, documentInfo.PageCount)); // Iterate over page text areas // We ignore null-checking as we have checked text areas extraction feature support earlier foreach(PageTextArea a in parser.GetTextAreas(pageIndex)) { // Print a rectangle and text area value: Console.WriteLine(string.Format("R: {0}, Text: {1}", a.Rectangle, a.Text)); } } }