[Rd] Inconsistencies of Fundamental Summary Functions

Dario Strbenac dstr7320 at uni.sydney.edu.au
Fri Jan 8 06:00:04 CET 2016


Good day,

Functions such as min and max work on a data frame of numbers, but other functions, like median do not. For example,

database <- data.frame(Name = LETTERS[1:3], Amount = 1:3, Price = 4:6)
> min(database[, 2:3])
[1] 1
> mean(database[, 2:3])
[1] NA
> median(database[, 2:3])
Error in median.default(database[, 2:3]) : need numeric data

This observation was made when teaching university students in the first year of their degrees.  Some functions require coercion by using as.matrix but others don't. Surely these simple commands should be consistent.

Also, the documentation for min and max is incomplete. It states "‘Numeric’ arguments are vectors of type integer and numeric, and logical." It should also mention that it works on data frames, although not when data frames are represented as lists, which can easily be coerced back to data frames.

> min(as.list(database[, 2:3]))
Error in min(as.list(database[, 2:3])) : 
  invalid 'type' (list) of argument

--------------------------------------
Dario Strbenac
PhD Student
University of Sydney
Camperdown NSW 2050
Australia


More information about the R-devel mailing list