pathon小白实践第四天，爬音乐

乐于助人 发表于 2019-7-30 13:14

第四天了，爬个音乐，嘿嘿，不多说，上源码，然后我要去学scrapy框架了，等我学好了，回来继续给大家分享。:lol:lol:lol
import requests
import re
from lxml import etree
import os
class Spyder():
def headers(self):
   headers={
                        'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.110 Safari/537.36 SE 2.X MetaSr 1.0'
            }
   self.first_request(headers)
def first_request(self,headers):
   url = 'http://music.taihe.com/artist'
   response = requests.get(url,headers=headers)
   html = etree.HTML(response.content.decode())
   art_name_list = html.xpath('//dt[@class="cover-img"]/a/img/@title')
   art_link_list = html.xpath('//dt[@class="cover-img"]/a/@href')
   for art_name,art_link in zip(art_name_list,art_link_list):
         if os.path.exists(art_name) == False: #如果当前没有Bigtit,就创建一个
            os.mkdir(art_name)

         self.second_request(art_name,art_link,headers)
def second_request(self,art_name,art_link,headers):
   response = requests.get('http://music.taihe.com'+art_link,headers=headers)
   html1 = response.content.decode()
   html = etree.HTML(response.content.decode())
   song_name_list = html.xpath('//span[@class="songname"]/a/@title')
   song_link_list =re.compile('<a href="/song/(.*?)" class="songlist-songname namelink overdd" ').findall(html1)

   for song_name,song_link in zip(song_name_list,song_link_list):
         self.load_music(song_name,song_link,art_name,headers)

def load_music(self,song_name,song_link,art_name,headers):
   url= 'http://musicapi.taihe.com/v1/restserver/ting?method=baidu.ting.song.playAAC&format=jsonp&callback=jQuery17209000847668843108_1563975246089&songid='+song_link
   response = requests.get(url,headers=headers).content.decode()

   link =''.join(re.compile('{"show_link":"(.*?)",').findall(response)).replace('\\','')
   self.data_request(link,song_name,art_name,headers)
   # print("正在下载的歌曲是：".song_name)
def data_request(self,link,song_name,art_name,headers):
   file_name =art_name +'\\' +song_name +'.mp3'
   print('正在下载的歌曲是：',song_name,'作者：',art_name)
   response = requests.get(link,headers=headers).content
   with open(file_name,'wb') as f:
         f.write(response)

spyder=Spyder()
spyder.headers()

WangChun518 发表于 2019-7-30 14:20

lu_ 发表于 2019-7-30 14:10
https://www.52pojie.cn/forum.php?mod=redirect&goto=findpost&ptid=998477&pid=27148245
太大了，我选 ...

好吧谢谢{:301_986:}

lu_ 发表于 2019-7-30 14:10

WangChun518 发表于 2019-7-30 13:37
楼主你是学的哪个教程

https://www.52pojie.cn/forum.php?mod=redirect&goto=findpost&ptid=998477&pid=27148245
太大了，我选择放弃，从入门学起{:1_907:}

妖狠站得稳 发表于 2019-7-30 13:28

谢谢分享

yutianll 发表于 2019-7-30 13:29

这么厉害的吗，4天就学会了{:1_921:}

水鸟发表于 2019-7-30 13:30

谢谢分享

WangChun518 发表于 2019-7-30 13:37

楼主你是学的哪个教程{:301_1001:}

smartkey 发表于 2019-7-30 13:40

楼主厉害，刚开始就比较牛

yt1010306 发表于 2019-7-30 13:45

楼主应该懂其他语言。

Kum 发表于 2019-7-30 13:49

哈哈哈哈楼主你把标题打错了

隰则有泮 发表于 2019-7-30 14:10

看着奇怪的编程风格

页: [1] 2 3 4

吾爱破解 - 52pojie.cn's Archiver

pathon小白实践第四天，爬音乐