mining.py 文件源码

python

阅读 31 收藏 0 点赞 0 评论 0

项目：CSCE482-WordcloudPlus 作者: ggaytan00 项目源码文件源码

def scrape(site_address):
    page = requests.get(site_address)           #returns raw html
    page = clean_html(page.content) #removes <script> tags and their contents
    document = html.document_fromstring(page)   #removes all other tags

    return document.text_content()

# takes a url as a string and returns a STRING of all of the words
# that are used on that webpage

评论列表正在加载评论...

文章目录

提
问题

写
面经

写
文章

微信
公众号

扫码关注公众号