Welcome folks today in this blog post we will be
extracting tables as
df from online pdf document url in command line. All the full source code of the application is shown below.
In order to get started you need to install the below libraries using the
pip command as shown below
pip install pandas
After this just make an
app.py file and copy paste the following code
from tabula import read_pdf
url = "URL OF THE PDF FILE"
df = read_pdf(url)
except Exception as e:
As you can see we are importing the
tabula library from that we are using the
read_pdf() and here you need to replace the
url of the pdf file. And then we are using the
read_pdf() method and passing the url of the pdf file.