爬虫助手WebScraper中文网

使用“Element click”以后,无法抓取标签里面的相关信息

[复制链接]
发表于 2022-3-5 01:20:20 | 显示全部楼层 |阅读模式

马上注册,结交更多好友,享用更多功能

您需要 登录 才可以下载或查看,没有帐号?立即注册

x
在selector的container中使用Element click

container

container


但无法抓取到标签里面的相关信息

2

2


以下是网页标签内,需要抓取的信息:

交易历史

交易历史


实在找不出是哪里出了问题,还请大神赐教!!!

测试网页:https://www.okx.com/metax/nft/ma ... fdae77987ade9270e/2


Web Scraper中文网 - 用户指南

① 首先下载插件,可以参考《Web Scraper插件版本归档》。
② 安装插件,可以参考《如何在谷歌Chrome浏览器上安装Web Scraper插件》。
③ 插件的使用教程,参考《Web Scraper插件使用教程
※ 遇到问题,鼓励先自行解决或网友互助,在《Web Scraper插件网友互助》求助。


回复

使用道具 举报

发表于 2022-3-6 23:13:10 | 显示全部楼层

看下结构

看下结构



你忽略了一个问题。你是一步到位这么做的。


jiage-tab这一步里面,选择容器比较重要。需要F12 控制台看一下元素。

我这里举例以价格为例。

以div.nft-detail-tab为父集的容器,因为这个已经把click后显示的表格数据包含在内(必须把后面需要提取的内容的元素包起来)

s1.jpg


你参考下我的地图。

  1. {"_id":"okx","startUrl":["https://www.okx.com/metax/nft/markets/0x6d37b607613c5d103576d56fdae77987ade9270e/2"],"selectors":[{"clickElementSelector":"div.okui-tabs-pane:nth-of-type(2)","clickElementUniquenessType":"uniqueHTMLText","clickType":"clickOnce","delay":2000,"discardInitialElements":"do-not-discard","id":"jiage-tab","multiple":true,"parentSelectors":["_root"],"selector":"div.nft-detail-tab","type":"SelectorElementClick"},{"delay":0,"id":"jiage","multiple":false,"parentSelectors":["jiage-tab"],"regex":"","selector":".markets-detail-table-price span","type":"SelectorText"}]}
复制代码



希望对你有帮助,以后类似需要点击才能显示的数据,如何抓取,有了眉目。
回复

使用道具 举报

 楼主| 发表于 2022-3-8 00:56:11 | 显示全部楼层
webscraper 发表于 2022-3-6 23:13
你忽略了一个问题。你是一步到位这么做的。

谢谢指点!!!参考您的代码已经弄清楚了你所讲的原理,非常感谢!
回复

使用道具 举报

 楼主| 发表于 2022-3-8 00:59:25 | 显示全部楼层
你的代码中,我还有一处不太明白,还请赐教!

就是container中的“Multiple”为什么不选中,就抓取不到所需要的数据呢?不是click一次就可以点开我们需要的“交易历史”页面了么?
回复

使用道具 举报

 楼主| 发表于 2022-3-8 01:03:49 | 显示全部楼层
另外还有一个问题请教一下,所在页面里面有一个“复制按钮”,是否能抓取其所复制的内容?

复制.png
回复

使用道具 举报

 楼主| 发表于 2022-3-8 02:03:18 | 显示全部楼层
我自己整理了一份整个页面的信息抓取,结果出现每个页面有2条抓取数据,而且其中1条数据的“container-record”部分数据为null. 不知道是哪里搞错了,还请赐教!!!
111.png

以下是我的Web Scraper代码:
  1. {"_id":"linglong-test","startUrl":["https://www.okx.com/metax/nft/markets/0x6d37b607613c5d103576d56fdae77987ade9270e/[3492-3495]"],"selectors":[{"clickElementSelector":"div.okui-tabs-pane:nth-of-type(2)","clickElementUniquenessType":"uniqueHTMLText","clickType":"clickOnce","delay":1000,"discardInitialElements":"do-not-discard","id":"container-record","multiple":true,"parentSelectors":["container"],"selector":"div.nft-detail-tab","type":"SelectorElementClick"},{"delay":0,"id":"事件 ","multiple":false,"parentSelectors":["container-record"],"regex":"","selector":"div.markets-detail-table-row:nth-of-type(2) div.markets-detail-table-cell:nth-of-type(1)","type":"SelectorText"},{"delay":0,"id":"价格 ","multiple":false,"parentSelectors":["container-record"],"regex":"","selector":"div.markets-detail-table-row:nth-of-type(2) .markets-detail-table-price span","type":"SelectorText"},{"delay":0,"id":"折合美金","multiple":false,"parentSelectors":["container-record"],"regex":"[0-9]+","selector":"div.markets-detail-table-row:nth-of-type(2) p.markets-detail-table-gray","type":"SelectorText"},{"delay":0,"id":"卖家地址","multiple":false,"parentSelectors":["container-record"],"regex":"","selector":"div.markets-detail-table-row:nth-of-type(2) p:nth-of-type(1) span.markets-detail-table-address","type":"SelectorText"},{"delay":0,"id":"买家地址","multiple":false,"parentSelectors":["container-record"],"regex":"","selector":"div.markets-detail-table-row:nth-of-type(2) p:nth-of-type(2) span.markets-detail-table-address","type":"SelectorText"},{"delay":0,"id":"时间 ","multiple":false,"parentSelectors":["container-record"],"regex":"","selector":"div.markets-detail-table-row:nth-of-type(2) p.markets-detail-table-address-date","type":"SelectorText"},{"delay":0,"id":"container","multiple":false,"parentSelectors":["_root"],"selector":"div.markets-detail-main","type":"SelectorElement"},{"delay":0,"id":"title","multiple":false,"parentSelectors":["container"],"regex":"","selector":"p.baseInfo-name","type":"SelectorText"},{"delay":0,"id":"状态 ","multiple":false,"parentSelectors":["container"],"regex":"","selector":"p.nft-price-info-price-text","type":"SelectorText"},{"delay":0,"id":"主页价格","multiple":false,"parentSelectors":["container"],"regex":"","selector":"span.nft-price-info-price-number","type":"SelectorText"},{"delay":0,"id":"价格折合美金","multiple":false,"parentSelectors":["container"],"regex":"[0-9]+","selector":"span.nft-price-info-usd-price","type":"SelectorText"},{"delay":0,"id":"token ID","multiple":false,"parentSelectors":["container"],"regex":"","selector":"div:nth-of-type(3) div.blockchain-info-item-value","type":"SelectorText"},{"delay":0,"id":"稀缺度","multiple":false,"parentSelectors":["container"],"regex":"","selector":"div.blockchain-info-item:nth-of-type(5) div.blockchain-info-item-value","type":"SelectorText"},{"delay":0,"id":"拥有者","multiple":false,"parentSelectors":["container"],"regex":"","selector":"div.user-item-hover","type":"SelectorText"}]}
复制代码


回复

使用道具 举报

发表于 2022-3-8 17:48:19 | 显示全部楼层
seduse 发表于 2022-3-8 02:03
我自己整理了一份整个页面的信息抓取,结果出现每个页面有2条抓取数据,而且其中1条数据的“container-reco ...

晚点我也学习下
回复

使用道具 举报

发表于 2022-3-8 22:40:56 | 显示全部楼层
seduse 发表于 2022-3-8 02:03
我自己整理了一份整个页面的信息抓取,结果出现每个页面有2条抓取数据,而且其中1条数据的“container-reco ...

看截图是重复了一次。你把container-record 的Multiple取消勾选,Click element uniqueness选择unique text再试试。
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则 需要先绑定手机号

QQ|Archiver|手机版|网站地图|爬虫助手WebScraper中文网 ( 渝ICP备18015624号-16 )

GMT+8, 2024-9-19 09:31 , Processed in 0.093663 second(s), 20 queries .

Powered by Discuz! X3.4

Copyright © 2001-2021, Tencent Cloud.

快速回复 返回顶部 返回列表