satData.py 文件源码

python
阅读 26 收藏 0 点赞 0 评论 0

项目:FA-IR_Ranking 作者: MilkaLichtblau 项目源码 文件源码
def __loadSATPDF(self, filename):
        print("loading SAT score pdf")
        """
        loads the SAT PDF file, deletes all nonsense and creates an array containing only the numbers
        from the table

        Return
        ------
        All numbers from the SAT table in a string array
        """
        pdf = pypdf.PdfFileReader(open(filename, "rb"))
        tableContents = []

        for page in pdf.pages:
            content = page.extractText()
            tableHeader = "Total \nMale Female \nScore \nNumber Percentile Number Percentile Number Percentile "
            tableFooter = "De˜nitions of statistical terms are provided online at research."
            tableContents += self.__getTableContent(content, tableHeader, tableFooter)
            if "Number" and  "Mean" and "S.D." in tableContents:
                tableContents = tableContents[:tableContents.index("S.D.") - 2]

        return tableContents
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号