logo

[r-crawling] 사례분석: 국토부 표준지공시지가 크롤링 (1편)

광역자치단체 코드 가져오기

library(httr)
res = GET('https://www.kais.kr/realtyprice/bjd/searchBjd.bjd?gubun=&addr_gbn=&sido=&sgg=&eub=&year=2015')
json = content(res)
sido = matrix(unlist(json$model$list), ncol=5, byrow=T)

기초자치단체 코드 가져오기

sgg.url = 'https://www.kais.kr/realtyprice/bjd/searchBjd.bjd?gubun=sgg&addr_gbn=&sido=11&sgg=&eub=&year=2015&sido_list=11'
sgg.mat = matrix(c(0,0,0,0,0), ncol=5)
for(code in sido[,5]){
  url = modify_url(sgg.url, query=list(sido=code, sido_list=code))
  res = GET(url)
  json = content(res)
  sgg = matrix(unlist(json$model$list), ncol=5, byrow=T)
  sgg.mat = rbind(sgg.mat, sgg)
}
sgg.mat = sgg.mat[-1,]

읍면동 코드 가져오기

eub.url = 'https://www.kais.kr/realtyprice/bjd/searchBjd.bjd?gubun=eub&addr_gbn=&sido=11&sgg=11680&eub=&year=2015&sido_list=11&sgg_list=11680'
res = GET(eub.url)
json = content(res)
matrix(unlist(json$model$list), ncol=4, byrow=T)
Previous
Selenium 사용 사례