Page 1 of 1

Creating modules from accessible pdf documents

Posted: Mon Oct 08, 2012 12:54 pm
by novar
Tim, is it possible to create a module of a pdf that can be searched like the other resources?
Also, how do I view .top files I have purchased?
Thanks
Robert

PS This is a great program

Re: Creating modules from accessible pdf documents

Posted: Mon Oct 08, 2012 1:49 pm
by Tim
Only if you extract the text from the PDF into a formatted text file and then make a Bible Analyzer module from it. See the User Manual on how to format a text file and create the module.

.top file? BA doesn't use that extention. It uses bib, dct, cmt, bk, imd, pll extentions for modules. Do you mean an esword .top file?

Re: Creating modules from accessible pdf documents

Posted: Mon Sep 16, 2013 10:23 pm
by Abigail111
Hi there
I never create modules from accessible pdf documents before.And i don't know the detailed solution for your questions.
But for pdf questions,i always using the pdf processing in c# control.It can solve all kinds of pdf issues.Hope to suitable for you too!

Re: Creating modules from accessible pdf documents

Posted: Wed Jan 20, 2016 10:10 pm
by arronlee
Hi, Abigail111.
Thanks for your nice sharing. As for me, I have seldom tried to
extract the text from the PDF into a formatted text file and then make a Bible Analyzer module from it before. Have you ever worked it out? I wonder whether you have any exprience about pdf extraction process. Because there is something wrong with my pdf reader. I need convert pdf into text or other formats. Any suggestion will be appreciated. Thanks in advance.



Best regards,
Lee

Re: Creating modules from accessible pdf documents

Posted: Tue Jul 20, 2021 4:36 pm
by novar
The only way to extract text files from a pdf is to use Adobe Acrobat... I have Acrbat 11 Pro stand alone. It can convert Pdf docs to RTF (Rich Text Format) then you can convert to BA module. I got the Acrobat 11 Pro from 3rd party seller. You can also convert with Adobe Acrobat 10 pro/standard
Robert

Re: Creating modules from accessible pdf documents

Posted: Sun Jul 25, 2021 3:20 pm
by epement
Novar wrote: "The only way to extract text files from a pdf is to use Adobe Acrobat... I have Acrbat 11 Pro stand alone."

I think that information is a few years out-of-date. It is possible to open PDF files in MIcrosoft Word, LibreOffice, and other tools:

https://support.microsoft.com/en-us/off ... 3379c2f63a

In LibreOffice, PDFs open in Draw with each line a separate element, so you do not get the full "paragraph flow" as in the source document, but you do have access to the full text. See https://www.ghacks.net/2018/02/13/using ... df-editor/

You often don't even have to pay for text extraction. See https://tools.pdfforge.org/extract-text

Or here https://docparser.com/blog/extract-text ... 244291.004

Here are some Python tools: https://dida.do/blog/how-to-extract-text-from-pdf