Intellipaat Back

Explore Courses Blog Tutorials Interview Questions
0 votes
2 views
in RPA by (12.7k points)

I am trying to read a PDF as text, and I can write it back with junk in it, which is fine as I have a parser component to get the bits I need.

My question is how can I read specific parts of the PDF and ignore the rest?

closed

1 Answer

0 votes
by (29.5k points)
selected by
 
Best answer

Well, in my opinion, what you can try is to use text scraping if your PDF is well formatted. But for doing this you need to open the PDF file and it must be visible for Native Scraping to work

PS: This works with Adobe Reader DC only if you have the right settings. Each time you open it again you have to open settings dialog. (Settings could be correct but it does not take them without opening the dialog)

Related questions

0 votes
1 answer
0 votes
1 answer
asked Jul 9, 2019 in RPA by Abhishek_31 (12.7k points)
0 votes
1 answer
asked Jan 21, 2020 in RPA by Prakhar_04 (29.5k points)
0 votes
1 answer
0 votes
1 answer

31k questions

32.8k answers

501 comments

693 users

Browse Categories

...