用pyPdf打开pdf网址
发布于 2021-01-29 18:24:32
如何从网址而不是从磁盘打开pdf
就像是
input1 = PdfFileReader(file("http://example.com/a.pdf", "rb"))
我想从网上打开几个文件,然后下载所有文件的合并文件。
关注者
0
被浏览
45
1 个回答
-
我认为urllib2将为您提供所需的东西。
from urllib2 import Request, urlopen from pyPdf import PdfFileWriter, PdfFileReader from StringIO import StringIO url = "http://www.silicontao.com/ProgrammingGuide/other/beejnet.pdf" writer = PdfFileWriter() remoteFile = urlopen(Request(url)).read() memoryFile = StringIO(remoteFile) pdfFile = PdfFileReader(memoryFile) for pageNum in xrange(pdfFile.getNumPages()): currentPage = pdfFile.getPage(pageNum) #currentPage.mergePage(watermark.getPage(0)) writer.addPage(currentPage) outputStream = open("output.pdf","wb") writer.write(outputStream) outputStream.close()