[R] issue with numeric

Ana Marija @okov|c@@n@m@r|j@ @end|ng |rom gm@||@com
Wed Dec 18 21:44:24 CET 2019


Hi Ivan,

here it is:

> str(a)
'data.frame':    17389 obs. of  21 variables:
 $ V1             : Factor w/ 17389 levels "ENSG00000000419",..: 14093
14622 14705 14651 14784 17138 14773 14163 14569 15156 ...
 $ V2             : Factor w/ 22 levels "chr1","chr10",..: 1 1 1 1 1 1
1 1 1 1 ...
 $ V3             : int  29571 36082 91106 133724 173863 200323 259025
297503 348367 493242 ...
 $ V4             : int  29570 36081 91105 133723 173862 200322 259024
297502 348366 493241 ...
 $ V5             : Factor w/ 2 levels "-","+": 1 1 1 1 1 1 1 1 1 1 ...
 $ V6             : int  983 1006 1169 1340 1441 1620 1897 2032 2175 2697 ...
 $ V7             : int  -828479 -38709 -782443 69986 -831895 -980529
-647609 -946918 -631093 -886444 ...
 $ V8             : Factor w/ 17104 levels "1:10095977:G:GT",..: 7339
4761 2344 7480 12580 4781 14856 3061 9397 6938 ...
 $ V9             : Factor w/ 22 levels "chr1","chr10",..: 1 1 1 1 1 1
1 1 1 1 ...
 $ V10            : int  858049 74790 873548 63735 1005757 1180851
906633 1244420 979459 1379685 ...
 $ V11            : int  858049 74790 873548 63738 1005771 1180851
906633 1244420 979459 1379685 ...
 $ Effect_allele  : Factor w/ 358 levels "A","AAAAACAAAAC",..: 267 190
92 92 54 92 190 1 267 267 ...
 $ Baseline_allele: Factor w/ 435 levels "A","AAAAAAAAAATAAAAAT",..:
112 112 325 175 1 325 325 237 112 237 ...
 $ V12            : int  404 404 404 404 404 404 404 404 404 404 ...
 $ V13            : num  348 347 347 339 342 ...
 $ V14            : num  1.04 1.03 1.02 1.04 1.04 ...
 $ V15            : num  140 138 153 154 171 ...
 $ V16            : num  0.000742 0.002822 0.000626 0.000798 0.002894 ...
 $ V17            : num  0.269 -0.687 -0.285 -0.398 0.341 ...
 $ V18            : num  0.196 0.531 0.203 0.265 0.639 ...
 $ V19            : num  0.198 0.529 0.199 0.261 0.634 ...

and this:

> which(is.na(as.numeric(as.character(a[,18]))))
 [1] 10757 11062 11063 11064 11065 11066 11067 11068 11069 11070 11071 11072
[13] 11073 11074 11075
> which(is.na(as.numeric(as.character(a[,19]))))
 [1] 10757 11062 11063 11064 11065 11066 11067 11068 11069 11070 11071 11072
[13] 11073 11074 11075

columns 18 and 19 seems to be numeric, what is could be the issue?

On Wed, Dec 18, 2019 at 1:49 PM Ivan Krylov <krylov.r00t using gmail.com> wrote:
>
> On Wed, 18 Dec 2019 12:25:24 -0600
> Ana Marija <sokovic.anamarija using gmail.com> wrote:
>
> > Error in cor(D[, 18 + exon_offset], D[, 19 + exon_offset]) :
> >   'x' must be numeric
>
> Try str(a) to find out the types of the columns. A stray typo could
> make a representation of a number impossible to parse and make the
> whole column textual. Use
> which(is.na(as.numeric(as.character(a[,column_number])))) to find out
> the row number where it happened (using extra as.character() here in
> case the column is a factor).
>
> --
> Best regards,
> Ivan



More information about the R-help mailing list