https://www.g2b.go.kr:8101/ep/preparation/orderplan/orderplanPubList.do
ȸ鿡¼ °Ë»ö¾î¸¦ ³ÖÀº °á°ú¸¦ Å©·Ñ¸µ ÇÏ´Â °ÍÀÌ ¸ñÇ¥ÀÔ´Ï´Ù. Python À» ÀÌ¿ëÇÕ´Ï´Ù.
±×·±µ¥.. Å©·Ñ¸µÀ» ÇÒ¶§, °Ë»ö¾î¸¦ Çѱ۷Π³ÖÀ¸¸é °Ë»öÀÌ ¾ÈµË´Ï´Ù
±×µµ ±×·²°ÍÀÌ?
Å©·Ò °³¹ßÀÚ µµ±¸¸¦ º¸¸é ¾Æ·¡ ó·³ ³ª¿É´Ï´Ù.
Form Data ¸¦ º¸¸é, ¾Æ·¡ ó·³ º¸ÀÔ´Ï´Ù.
voTaskClCd=&userCl=&popupYn=&popId=&popTaskTypeCd=&fromOrderEra=202112&toOrderEra=202201&taskTypeCd=&taskClCd=&taskTypeCd0=&taskTypeCd1=&taskTypeCd3=&taskTypeCd2=&taskTypeCd4=&fromOrderEraYear=2021&fromOrderEraMonth=12&toOrderEraYear=2022&toOrderEraMonth=01&ppsWay=&fromReleaseDt=2021%2F10%2F22&toReleaseDt=2021%2F12%2F21&instNm=&instCd=&instAddr=&industry1=&industry2=&bizNm=%BA%F1%B4%EB%B8%E9&totalRecordCount=26954&downloadRange=1&recordCountPerPage=10
ÀÌ ºÎºÐÀÌ µµÀúÈ÷ ÇØ°áÀÌ ¾ÈµÇ³×¿ä.
¹°·Ð, Selenium À¸·Î ±¸ÇöÇÏ¸é ¹®Á¦ ¾øÀÌ µÇ°ÚÀ¸³ª, ±× ¹æ¹ý ¸»°í background ¿¡¼ Å©·Ñ¸µÀ» ÇÏ°í ½Í½À´Ï´Ù.
¼Ò½º´Â ¹¹.. ±×¾ß ¸»·Î °£´ÜÇÏÁÒ
--------------------------------------------
# -*- coding: utf-8 -*-
from bs4.element import SoupStrainer
import requests
from bs4 import BeautifulSoup
from lxml import etree
import time
from datetime import datetime, timedelta
def doSearch(searchWord) :
headers = {
'Referer': 'https://www.g2b.go.kr:8101/ep/preparation/orderplan/orderplanPubList.do'
}
params = dict()
params["voTaskClCd"] = "1"
params["userCl"] = ""
params["popupYn"] = ""
params["popId"] = ""
params["popTaskTypeCd"] = ""
params["fromOrderEra"] = "202112"
params["toOrderEra"] = "202201"
params["taskTypeCd"] = ""
params["taskClCd"] = "1"
params["taskTypeCd0"] = ""
params["taskTypeCd1"] = ""
params["taskTypeCd3"] = ""
params["taskTypeCd2"] = ""
params["taskTypeCd4"] = ""
params["fromOrderEraYear"] = "2021"
params["fromOrderEraMonth"] = "12"
params["toOrderEraYear"] = "2022"
params["toOrderEraMonth"] = "01"
params["ppsWay"] = ""
params["fromReleaseDt"] = "2021/10/21"
params["toReleaseDt"] = "2021/12/20"
params["instNm"] = ""
params["instCd"] = ""
params["instAddr"] = ""
params["detailPrdnmNo"] = ""
params["industry1"] = ""
params["industry2"] = ""
params["bizNm"] = searchWord #"" # <-------------------------- ¿©±â°¡ ¹®Á¦ "ºñ´ë¸é" À¸·Î °Ë»öÇÏ¸é °Ë»ö°á°ú ¾È³ª¿È
params["totalRecordCount"] = "11"
params["recordCountPerPage"] = "10"
# print(params)
res = requests.post('https://www.g2b.go.kr:8101/ep/preparation/orderplan/orderplanPubList.do', data=params, headers=headers)
soup = BeautifulSoup(res.text, 'html.parser')
print('soup=', soup)
doSearch('ºñ´ë¸é') |
¿©±â¼ EUC-KR·Î µðÄÚµù ÇØ º¸¸é "ºñ´ë¸é"À¸·Î ³ª¿À³×¿ä.
euc-kr ÀÌ ÈùÆ®¿´¾ú³×¿ä.