[R] Why do I have a column called row.names?

Rui Barradas ruipbarradas at sapo.pt
Mon Jun 4 21:01:41 CEST 2012


Hello,

It must be something else. Mandatory row names have nothing to do with a 
column header.
I've put the data example in a tab separated file and the strange 
behavior was not reproduced.

 > read.delim("test.txt", row.names=NULL, fill=TRUE)
     start     stop           Symbol Insert.sequence Clone.End.Pair FISH
1  203048 67173930 ABC8-43024000D23    TI:993812543   TI:993834585   NA
2  255176 87869359 ABC8-43034700N15    TI:995224581   TI:995237913   NA
3 1022033  1060472    ABC27-1253C21   TI:2094436044  TI:2094696079   NA
4 1022033  1061172     ABC23-1388A1   TI:2120730727  TI:2121592459   NA


With read.table, I tried with and without header=TRUE. No "success".

Rui Barradas

Em 04-06-2012 19:30, Bert Gunter escreveu:
> Actually, I think it's ?data.frame that he should read.
>
> The salient points are that:
> 1. All data frames must have unique row names. If not provided, they
> are produced. Row numbers **are** row names.
>
> 2. The return value of read methods are data frames.
>
> -- Bert
>
> On Mon, Jun 4, 2012 at 11:05 AM, David L Carlson<dcarlson at tamu.edu>  wrote:
>> Try help("read.delim") - always a good strategy before using a function for
>> the first time:
>>
>> In it, you will find: "Using row.names = NULL forces row numbering. Missing
>> or NULL row.names generate row names that are considered to be 'automatic'
>> (and not preserved by as.matrix)."
>>
>> ----------------------------------------------
>> David L Carlson
>> Associate Professor of Anthropology
>> Texas A&M University
>> College Station, TX 77843-4352
>>
>>
>>> -----Original Message-----
>>> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-
>>> project.org] On Behalf Of Ed Siefker
>>> Sent: Monday, June 04, 2012 12:47 PM
>>> To: r-help at r-project.org
>>> Subject: [R] Why do I have a column called row.names?
>>>
>>> I'm trying to read in a tab separated table with read.delim().
>>> I don't particularly care what the row names are.
>>> My data file looks like this:
>>>
>>>
>>> start   stop    Symbol  Insert sequence Clone End Pair  FISH
>>> 203048  67173930        ABC8-43024000D23                TI:993812543
>>>   TI:993834585
>>> 255176  87869359        ABC8-43034700N15                TI:995224581
>>>   TI:995237913
>>> 1022033 1060472 ABC27-1253C21           TI:2094436044   TI:2094696079
>>> 1022033 1061172 ABC23-1388A1            TI:2120730727   TI:2121592459
>>>
>>>
>>>
>>> I have to do something with row.names because my first column has
>>> duplicate entries.  So I read in the file like this:
>>>
>>>> BACS<-read.delim("testdata.txt", row.names=NULL, fill=TRUE)
>>>> head(BACS)
>>>    row.names    start             stop Symbol Insert.sequence
>>> Clone.End.Pair
>>> 1    203048 67173930 ABC8-43024000D23     NA    TI:993812543
>>> TI:993834585
>>> 2    255176 87869359 ABC8-43034700N15     NA    TI:995224581
>>> TI:995237913
>>> 3   1022033  1060472    ABC27-1253C21     NA   TI:2094436044
>>> TI:2094696079
>>> 4   1022033  1061172     ABC23-1388A1     NA   TI:2120730727
>>> TI:2121592459
>>>    FISH
>>> 1   NA
>>> 2   NA
>>> 3   NA
>>> 4   NA
>>>
>>>
>>> Why is there a column named "row.names"?  I've tried a few different
>>> ways of invoking this, but I always get the first column named
>>> row.names,
>>> and the rest of the columns shifted by one.
>>>
>>> Obviously I could fix this by using row.names<-, but I'd like to
>>> understand
>>> why this happens.  Any insight?
>>>
>>> ______________________________________________
>>> R-help at r-project.org mailing list
>>> https://stat.ethz.ch/mailman/listinfo/r-help
>>> PLEASE do read the posting guide http://www.R-project.org/posting-
>>> guide.html
>>> and provide commented, minimal, self-contained, reproducible code.
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>
>



More information about the R-help mailing list