本帖最后由 hj170520 于 2020-11-20 22:14 编辑
请问各位大佬们,
“https://www.cbirc.gov.cn/branch/beijing/view/pages/common/ItemList.html?itemPId=1851&itemId=1855&itemUrl=ItemListRightList.html&itemName=%E8%A1%8C%E6%94%BF%E5%A4%84%E7%BD%9A#2”
想爬取该网站的一些“处罚信息”,但这个网站返回来得值好像全是“JS”格式的代码??
根本抓不到网站的源代码啊!
请问怎么回事呢
这是爬取的源文件,和网站内容相差甚远呢!
[HTML] 纯文本查看 复制代码 <!DOCTYPE html>
<html lang="zh-cn">
<head>
<meta charset="UTF-8">
<title>ä¸-国银行保险监督管理委员会</title>
<meta name="author" content="">
<meta name="description" content="">
<meta name="keywords" content="">
<meta name="SiteName" content="">
<meta name="SiteDomain" content="">
<meta name="SiteIDCode" content="">
<meta name="ColumnName" content="">
<meta name="ColumnDescription" content="">
<meta name="ColumnKeywords" content="">
<meta name="ColumnType" content="">
<meta http-equiv="Window-target" content="_top">
<link rel="Shortcut Icon" href="favicon.ico">
<meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1">
<meta name="viewport" content="width=device-width, initial-scale=1">
<link href="/branch/css/common/base.css?v=20200108" rel="stylesheet" />
<link href="/branch/css/common/Common.css?v=20200108" rel="stylesheet" />
<!--[if lt IE 9]>
<script src="/branch/js/common/html5shiv.min.js"></script>
<script src="/branch/js/common/respond.js"></script>
<![endif]-->
</head>
<body>
<div class="main ng-cloak" ng-app="myApp">
<tpl src="/branch/view/components/Header.html"></tpl>
<div class="content" ng-controller="itemListCtrl">
<link href="/branch/css/common/share.css?v=20200108" rel="stylesheet" />
<!-- <div class="breadcrumb">
<ul>
<li>当前位置:
<span id="currentLocation"></span>
</li>
</ul>
</div> -->
<div class="breadcrumb">
<ul>
<li>当前位置:
<a ng-href="{{breadcrumb_shouye}}">é|–é¡μ</a>
</li>
<li ng-repeat="x in breadcrumb_detail">
<a>{{x.itemName}} </a>
</li>
</ul>
</div>
<div class="main">
<div class="row container">
<div class="caidan-left-div">
<tpl src="/branch/view/pages/ItemListSide.html"></tpl>
</div>
<div class="caidan-right-div">
<tpl id="itemList"></tpl>
</div>
</div>
</div>
</div>
<tpl src="/branch/view/components/Footer.html"></tpl>
</div>
<script src="/branch/js/common/jquery/jquery-1.11.2.min.js"></script>
<script>
function queryParam(name) {
var reg = new RegExp("(^|&)" + name + "=([^&]*)(&|$)");
var r = window.location.search.substr(1).match(reg);
if (r != null) return unescape(r[2]); return null;
};
$("#itemList").attr("src", "/branch/view/pages/common/" + queryParam("itemUrl"));
</script>
<script src="/branch/js/common/angular.1.2.32.min.js"></script>
<script src="/branch/js/common/jquery.share.min.js"></script>
<script src="/branch/js/common/Script.js?v=20200108"></script>
<script src="/branch/js/common/Nav.js?v=20200108"></script>
<script src="/branch/js/common/ItemList.js?v=20200108"></script>
<script type="text/javascript">
$(document).ready(function () {
$('#share-more').myHoverTip('share-more-all');
$('#share-weixin').share({ sites: ['wechat'] });
$('#share-weibo').share({ sites: ['weibo'] });
$('#share-qzone').share({ sites: ['qzone'] });
$('#share-qq').share({ sites: ['qq'] });
})
</script>
<!--custom-->
</body>
</html> |