用Python检索我的世界mod是否可用于服务器（通常用于整合包转服务端）

pengGgxp 发表于 2023-12-7 16:39

缘由是一开始有朋友找我开个服务器玩我的世界，想玩整合包，我在网上搜了很久，发现没有现成的服务端，只能自己做，但是又碍于有的mod不支持服务端，直接CV是不行了，所以就去找办法，最后想到可以用百科中的环境配置进行查看。然后就憋出了下面这些东西：
import os
import re
import time

import requests
from lxml import etree

def iterate_path(path):
'''
遍历我的世界mod文件夹，将可能的mod列表返回（因为现在的mod名称识别是机器识别，后面我想加入语言识别，单词通顺才加入mod列表并返回）
:param path:mod路径
:return:mod名称列表，供爬虫搜索
'''

# 遍历目录
List_dir = os.listdir(path)
# 处理前缀问题，如果前缀相同，先把前缀删去
re_mod = re.compile(r'[^_\-(（]+')
mod_List = []
mod_List2 = []
chongfu = []
# 首先全部遍历一次
for i in List_dir:
   tmp1 = re_mod.search(i)
   if tmp1:
         mod_List2.append(tmp1.group())
   else:
         mod_List2.append(i[:-4])
# 统计每个元素的个数，如果大于1，就加入重复前缀列表
for i in range(len(mod_List2)):
   tmp = mod_List2.count(mod_List2)
   if tmp > 1 and mod_List2 not in chongfu:
         chongfu.append(mod_List2)
# 再次遍历，将最后的mod名称列表返回
for i in List_dir:
   tmp1 = re_mod.findall(i)
   if tmp1 not in chongfu and tmp1 != None:
         mod_List.append(tmp1)
   elif tmp1 in chongfu:
         mod_List.append("".join(tmp1[:2]))
   else:
         mod_List.append(i[:-4])
# print(mod_List)
return mod_List

def crawl(List_mod):
'''
通过mod列表进行网站检索，并把运行环境写入result.txt文件，供用户使用
:param List_mod: mod名称列表
:return:
'''

# 请求头设置
header = {
   "Cookie": "1111111111111111",
   "User-Agent": "1111111111"
}

# 遍历mod名称列表，进行爬取
for key in List_mod:
   # xpath路径外层搜索xpath
   xpath1 = "/html/body/div/div/div/div/div/div/div/div/a//@href"
   # 确定url
   url = f"https://search.mcmod.cn/s?key={key}&site=&filter=1&mold=0"
   # 进行尝试，并设置重试次数
   ci = 1
   while True:
         res = requests.request("get", url=url, headers=header)
         xhtml = etree.HTML(res.text)
         mod_href = xhtml.xpath(xpath1)
         if mod_href == []:
            if ci > 5:
               with open("result.txt", "a+", encoding='utf-8') as file:
                     file.write(key + " 获取失败\n")
               print(key + " 获取失败\n")
               break
            print(f'获取失败了，正在重试这个链接{ci}/5:' + url)
            ci += 1
            time.sleep(2)
            continue
         else:
            break
   if ci > 5:
         continue
   print('获取链接成功 ' + str(mod_href) + " ", end='')
   # 内层xpath
   xpath2 = "/html/body/div/div/div/div/div/div/div/div/ul/li//text()"

   xpath3 = "/html/body/div/div/div/div/div/div//text()"
   #查找环境设置并写入文件
   res = requests.request("get", url=mod_href, headers=header)
   xhtml = etree.HTML(res.text)
   ispel_txt = key + " 搜索到的MOD：" + str("|".join(xhtml.xpath(xpath3))) + " " + str(xhtml.xpath(xpath2))
   with open("result.txt", "a+", encoding='utf-8') as file:
         file.write(ispel_txt + "\n")
   print(ispel_txt)

   time.sleep(3)

if __name__ == "__main__":
# 获取mod列表
path = "F:\Minecraft\server\mods"
List_mod = iterate_path(path)
# 进行mod检索用的是https://www.mcmod.cn/
List_Unavailable_mod = crawl(List_mod)

这个识别是有点问题的，所以可能检索出来的结果有差别，但是大部分是可以的，那些小部分的可以自己再去检索一下。

yy103050 发表于 2023-12-7 22:10

有点酷，这代码挺全的，直接能用

1024A1024 发表于 2023-12-7 23:03

有没有steam游戏 mod ID查询工具，根据游戏查id的？

pengGgxp 发表于 2023-12-7 23:49

1024A1024 发表于 2023-12-7 23:03
有没有steam游戏 mod ID查询工具，根据游戏查id的？

能说清楚点吗？有点不理解

lizy169 发表于 2023-12-8 06:36

思路阔以，学习一下

页: [1]

吾爱破解 - 52pojie.cn's Archiver

用Python检索我的世界mod是否可用于服务器（通常用于整合包转服务端）