[R] Problem with download.file ?

Giles Crane gilescrane at verizon.net
Fri Mar 27 15:25:56 CET 2015


# download.file() Seems to put the xlsx file onto hard drive.

>download.file("http://www.udel.edu/johnmack/data_library/zipcode_centroids.xlsx", "zipcode_centroids.xlsx")
trying URL 'http://www.udel.edu/johnmack/data_library/zipcode_centroids.xlsx'
Content type 'application/vnd.openxmlformats-officedocument.spreadsheetml.sheet' length 2785832 bytes (2.7 Mb)
opened URL
downloaded 2.7 Mb


# Trouble reading file with xlsx.

library(xlsx)
Loading required package: rJava
Loading required package: xlsxjars
Warning messages:
1: package ‘xlsx’ was built under R version 3.1.3
2: package ‘rJava’ was built under R version 3.1.3

>df <- read.xlsx2("zipcode_centroids.xlsx", sheetIndex=1)
Error in .jcall("RJavaTools", "Ljava/lang/Object;", "invokeMethod", cl,  :
   java.util.zip.ZipException: invalid entry size (expected 1168 but got 1173 bytes)


# I downloaded the file manually (same name) from the web page and tried again.
# Then I read the file into R with xlsx successfully.


>df <- read.xlsx2("/zipdist/zipcode_centroids.xlsx", sheetIndex=1)
>str(df)
'data.frame':	42961 obs. of  8 variables:
  $ ZIPCODE  : Factor w/ 42961 levels "01001","01002",..: 1 2 3 4 5 6 7 8 9 10 ...
  $ TOWN.    : Factor w/ 18955 levels "Aaronsburg","Abbeville",..: 85 333 333 333 898 1089 1459 1620 1899 2929 ...
  $ STATE    : Factor w/ 52 levels "AK","AL","AR",..: 21 21 21 21 21 21 21 21 21 21 ...
  $ LATITUDE : Factor w/ 37352 levels "-7.209975","19.101978",..: 28020 28948 28916 28971 29047 28624 28326 28418 28197 28603 ...
  $ LONGITUDE: Factor w/ 37241 levels "-100.00991","-100.02632",..: 8799 8706 8811 8715 8470 8639 9019 8608 8531 9065 ...
  $ STFIPS   : Factor w/ 51 levels "01","02","04",..: 22 22 22 22 22 22 22 22 22 22 ...
  $ CD       : Factor w/ 55 levels "00","01","02",..: 3 2 2 2 2 2 2 3 3 2 ...
  $ CONG_DIST: Factor w/ 436 levels "01_01","01_02",..: 191 190 190 190 190 190 190 191 191 190 ...

# Is there a problem with download.file() when file is an Excel file or this particular Excel file?

-- 
Giles L Crane, MPH, ASA, NJPHA
Statistical Consultant and R Instructor
621 Lake Drive
Princeton, NJ  08540
Phone: 609 924-0971
Email: gilescrane at verizon.net


	[[alternative HTML version deleted]]



More information about the R-help mailing list