Python3 Selenium+Chr

Selenium Chr 2023-01-31 01:01:15 275人浏览薄情痞子

Python 官方文档：入门教程 => 点击学习

摘要

以前抓取动态网页是用Phantomjs + selenium + ChromeDriver，但是新版的Selenium不支持PhantomJS了，程序跑的时候总会跳出一些warnings.现在的操作是放弃PhantomJS，直接用head

以前抓取动态网页是用Phantomjs + selenium + ChromeDriver，但是新版的Selenium不支持PhantomJS了，程序跑的时候总会跳出一些warnings.

现在的操作是放弃PhantomJS，直接用headless ChromeDriver。可直接在Google主页下载个ChromeDriver，都是支持headless的。

下面的程序就是启动driver，抓取数据，关闭driver的例子。记得要关掉driver，不然会占内存。

# -*- coding: UTF-8 -*-
'''
@version: python 3.6
@introduction:
@author: 
@date: 2018-3
'''

from selenium import WEBdriver
from selenium.webdriver.chrome.options import Options

# 启动driver
def init_web_driver():
    global driver
    chrome_options = Options()
    chrome_options.add_argument('--headless')
    chrome_options.add_argument('--disable-gpu')
    driver_path = 'E:\chromedriver.exe'    #这里放的就是下载的driver本地路径
    driver = webdriver.Chrome(chrome_options=chrome_options, executable_path = driver_path)

# 关掉driver
def close_web_driver():
    driver.quit()

def get_data():
    driver.get('https://www.baidu.com')
  driver.implicitly_wait(10)  # wait up to 10 seconds for the elements to become available
    # ======  网页中静态部分抓取，采用BeautifulSoup去解析   
    html = driver.page_source    # 获取网页html
    html_soup = BeautifulSoup(html.text,"lxml")
    time.sleep(0.1)
    coin_list = html_soup .find(name='table', attrs={"class": "table maintable"})
    # 页面元素的提取请查看 BeautifulSoup的用法
    # ====== 网页中动态部分抓取，采用driver自带的方法
    # 下面展示的从调用百度搜索，在搜索框中输入"headless chrome"，然后获取结果。具体自行百度driver的用法
    text = driver.find_element_by_CSS_selector('#kw')
    search = driver.find_element_by_css_selector('#su')
    text.send_keys('headless chrome')
    # search
    search.click()
    driver.get_screenshot_as_file('search-result.png')
    results = driver.find_elements_by_xpath('//div[@class="result c-container "]')
    for result in results:
        res = result.find_element_by_css_selector('a')
        title = res.text
        link = res.get_attribute('href')
        print ('Title: %s \nLink: %s\n' % (title, link) )


if __name__ == '__main__':
    init_web_driver()
    get_data()
    close_web_driver()

您可能感兴趣的文档:

--结束END--

本文标题: Python3 Selenium+Chr

本文链接: https://lsjlt.com/news/184560.html(转载时请注明来源链接)

有问题或投稿请发送至: 邮箱/279061341@qq.com QQ/279061341

猜你喜欢

Python3 Selenium+Chr

以前抓取动态网页是用PhantomJS + Selenium + ChromeDriver，但是新版的Selenium不支持PhantomJS了，程序跑的时候总会跳出一些warnings.现在的操作是放弃PhantomJS，直接用head...

99+

2023-01-31

Selenium Chr
python3 selenium + f

一、分析：抓取动态页面js加载的人民日报里面的新闻详情 https://wap.peopleapp.com/news/1先打开，然后查看网页源码，发现是一堆js，并没有具体的每个新闻的url详情，于是第一反应，肯定是js动态加载拼接的u...

99+

2023-01-31

selenium
python3爬虫-通过selenium

from selenium import webdriver from selenium.common.exceptions import NoSuchElementException from selenium.webdriver.c...

99+

2023-01-31

爬虫 selenium
通过Python3+selenium自动

使用的是selenium，最开始接触这个模块包是爬虫，这次是工作需要来自动测试网页。记录一下模拟不同浏览器的方式总共测了两个浏览器，Firefox在centos7上和chrome在win10上。都是模拟的IPAD方式访问，原因是使用...

99+

2023-01-31

selenium
python3+selenium访问网页

selenium的安装直接通过pip install selenium就可以安装了，非常简单。使用Firefox时报错：selenium.common.exceptions.WebDriverException: Message: '...

99+

2023-01-31

网页 selenium
python3+selenium框架设计

使用HTMLTestRunner可以生成测试报告。HTMLTestRunner是unittest模块下的一个拓展，原生的生成报告样式比较丑，GitHub上有大佬优化过后的版本：GitHub地址。下载之后解压应该是这样的我们需要使用的是...

99+

2023-01-31

框架 selenium
Python3爬虫利器:Selenium怎么安装

小编给大家分享一下Python3爬虫利器:Selenium怎么安装，希望大家阅读完这篇文章之后都有所收获，下面让我们一起去探讨吧！python主要应用领域有哪些1、云计算，典型应用OpenStack。2、WEB前端开发，众多大型网站均为Py...

99+

2023-06-14
【python】chr与ord函数的使用

ord()是将已知字母转换成其顺序值；chr()是将已知字母的顺序至转换成其对应的字母>>> ord("a") 97 >>> ord("A") 65 >>>...

99+

2023-01-31

函数 python chr
VBS中CHR码值对应什么

这篇文章给大家分享的是有关VBS中CHR码值对应什么的内容。小编觉得挺实用的，因此分享给大家做个参考，一起跟随小编过来看看吧。Chr(0) 为0的...

99+

2023-06-09
ASP/VBScript中CHR(0)的示例分析

这篇文章给大家分享的是有关ASP/VBScript中CHR(0)的示例分析的内容。小编觉得挺实用的，因此分享给大家做个参考，一起跟随小编过来看看吧。该字符标识着字符串的结束，也称作null-terminated，这个给脚本编程尤其是ASP编...

99+

2023-06-08
如何在VBS中使用Chr 函数

如何在VBS中使用Chr 函数？很多新手对此不是很清楚，为了帮助大家解决这个难题，下面小编将为大家详细讲解，有这方面需求的人可以来学习下，希望你能有所收获。Chr 函数返回与指定的 ANSI 字符代码相对应的字符。Chr(charcode)...

99+

2023-06-08
利用selenium 3.7和python3添加cookie模拟登陆的实现

前言随着Python3的普及，Selenium3也跟上了行程。而Selenium3最大的变化是去掉了Selenium RC，另外就是Webdriver从各自浏览器中脱离，必须单独下载。本文就来介绍了关于s...

99+

2022-06-04

selenium cookie
oracle中chr函数的用法是什么

在Oracle中，CHR()函数用于返回指定ASCII代码所对应的字符。它的语法如下： CHR(ascii_code) 其中，as...

99+

2024-04-09

oracle
oracle中chr函数的作用是什么

在 Oracle 数据库中，CHR 函数用于返回指定 ASCII 码对应的字符。CHR 函数的语法如下： CHR(n) 其中 n 是...

99+

2024-04-09

oracle
Python中的chr()函数与ord()函数解析

目录chr()函数与ord()函数解析chr()函数ord()函数应用:凯撒密码的加密和解码ord()函数与chr()函数的区别chr()函数与or&#...

99+

2024-04-02
python + selenium +

使用python3.6在Ubuntu中进行了一项使用Chrome headless浏览器的工作, 在此记录下遇到的问题以及解决方法. 入门参考 unning-selenium-with-headless-chrome Ubuntu中如何...

99+

2023-01-31

python selenium
Selenium + chromeDr

在使用 selenium + chromeDriver + python3 截图时，遇上 Flash 无法加载，导致了截图 Falsh 是空白区。环境要求：selenium chromeDriver Python3 问题 chrome ...

99+

2023-01-31

Selenium chromeDr
Python函数介绍：chr函数的功能和示例

Python函数介绍：chr函数的功能和示例概述：在Python中，chr函数是内置函数之一。它接受一个整数参数，并返回对应的 Unicode 字符。功能：chr函数的主要功能是根据给定的 Unicode 码点返回对应的字符。Unicode...

99+

2023-11-04

关键词：Python函数表示Unicode编码值返回该编码对应的字符。
Selenium爬虫

第01节 Selenium 1. Selenium概述 Selenium是一个Web的自动化测试工具，最初是为网站自动化测试而开发的，Selenium 可以直接运行在浏览器上，它支持所有主流的浏览器。因为Selenium可以控制浏览器发...

99+

2023-09-18

python chrome 爬虫
selenium+webDriver+h

一、环境搭建工欲善其事，必先利其器。在这里，我们采用selenium+webDriver+headless Chrome(当然，这里使用FireFox、Safari浏览器都可以)来实现爬虫。 (一)工具 1.selenium,一个用于W...

99+

2023-01-30

selenium webDriver