[R] misbehavior with extract_numeric() from tidyr

arnaud gaboury arnaud.gaboury at gmail.com
Mon Apr 20 20:57:20 CEST 2015


On Mon, Apr 20, 2015 at 6:09 PM, William Dunlap <wdunlap at tibco.com> wrote:

> The hyphen without a following digit confuses tidyr::extract_numeric().
> E.g.,
>    > extract_numeric("23 ft-lbs")
>    Warning message:
>    In extract_numeric("23 ft-lbs") : NAs introduced by coercion
>    [1] NA
>    > extract_numeric("23 ft*lbs")
>    [1] 23
>

See[0] for the reason on the minus in the regex. It is not a bug but a wish.
I am honestly very surprised the maintainer decided to go with such a so
simple solution for negative numbers.

[0]https://github.com/hadley/tidyr/issues/20

Contact the BugReports address for the package
>    > packageDescription("tidyr")$BugReports
>    [1] "https://github.com/hadley/tidyr/issues"
> or package's maintainer
>    > maintainer("tidyr")
>    [1] "Hadley Wickham <hadley at rstudio.com>"
> to report problems in a user-contributed package.
>
>
>
> Bill Dunlap
> TIBCO Software
> wdunlap tibco.com
>
> On Mon, Apr 20, 2015 at 12:10 AM, arnaud gaboury <arnaud.gaboury at gmail.com
> > wrote:
>
>> R 3.2.0 on Linux
>> --------------------------------
>>
>> library(tidyr)
>>
>> playerStats <- c("LVL 10", "5,671,448 AP l6,000,000 AP", "Unique
>> Portals Visited 1,038",
>> "XM Collected 15,327,123 XM", "Hacks 14,268", "Resonators Deployed
>> 11,126",
>> "Links Created 1,744", "Control Fields Created 294", "Mind Units
>> Captured 2,995,484 MUs",
>> "Longest Link Ever Created 75 km", "Largest Control Field 189,731 MUs",
>> "XM Recharged 3,006,364 XM", "Portals Captured 1,204", "Unique Portals
>> Captured 486",
>> "Resonators Destroyed 12,481", "Portals Neutralized 1,240", "Enemy
>> Links Destroyed 3,169",
>> "Enemy Control Fields Destroyed 1,394", "Distance Walked 230 km",
>> "Max Time Portal Held 240 days", "Max Time Link Maintained 15 days",
>> "Max Link Length x Days 276 km-days", "Max Time Field Held 4days",
>> "Largest Field MUs x Days 83,226 MU-days")
>>
>>
>> -----------------------------------------------------------------------------------------------
>>  extract_numeric(playerStats)
>>  [1]             10 56714486000000           1038       15327123
>>    14268          11126           1744            294        2995484
>> [10]             75         189731        3006364           1204
>>      486          12481           1240           3169           1394
>> [19]            230            240             15             NA
>>        4             NA
>>
>>
>> ------------------------------------------------------------------------------------------------
>>  playerStats[c(22,24)]
>> [1] "Max Link Length x Days 276 km-days"      "Largest Field MUs x
>> Days 83,226 MU-days"
>>
>> --------------------------------------------------------------------------------------------
>>
>> I do not understand why these two vectors return NA when the function
>> extract_numeric() works well for others,
>>
>> Any wrong settings in my env?
>>
>> Thank you for hints.
>>
>>
>>
>> --
>>
>> google.com/+arnaudgabourygabx
>>
>> ______________________________________________
>> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
>
>


-- 

google.com/+arnaudgabourygabx
<https://plus.google.com/_/notifications/emlink?emr=05814804238976922326&emid=CKiv-v6PvboCFcfoQgod6msAAA&path=%2F116159236040461325607%2Fop%2Fu&dt=1383086841306&ub=50>

	[[alternative HTML version deleted]]



More information about the R-help mailing list