[R] parsing DOB data

Pär Leijonhufvud p@r@|e|jonhu|vud @end|ng |rom reg|onjh@@e
Thu Apr 16 19:01:18 CEST 2020


For more solutions look at https://stackoverflow.com/questions/33221603/r-lubridate-returns-unwanted-century-when-given-two-digit-year

The  proposed solution:
   some_dates <- c("3/18/75", "March 10, 1994", "10/1/80", "June 15, 1979")
   dates <- mdy(some_dates)
   future_dates <- year(dates) > year(Sys.Date())
   year(dates[future_dates]) <- year(dates[future_dates]) - 100

Should work for your case with one adaption (change mdy to ymd). At least it worked on your example.



Pär Leijonhufvud               .                                    par.leijonhufvud using regionjh.se
Sjukhuskemist                                                        +46(0)63-153 376, +46-(0)70-242 7006
Östersunds sjukhus

-----Original Message-----
From: R-help <r-help-bounces using r-project.org> On Behalf Of Peter Nelson via R-help
Sent: den 15 april 2020 20:31
To: r-help using r-project.org
Subject: [R] parsing DOB data

I have a data set (.csv) with date (eg date of birth) information stored as character vectors that I’m attempting to transform to POSIXct objects using the package lubridate (1.7.4). The problem that I’m trying to address is that my two digit years are invariably (?) parsed to 20xx. For example,

x <- c("45-12-03","01-06-24","64-9-15”)
[1] "2045-12-03" "2001-06-24" "2064-09-15”

These should be parsed as “1945-12-03” “2001-06-24” “1964-09-15”.

I've tried to use parse_date_time()—based on the documentation it looks to me as though the argument cutoff_2000 should allow me to address this, but it’s unclear to me how to implement this. As an example, I’ve tried

parse_date_time(x, cutoff_2000 = 01)

but get the following error message (and similar for other similar attempts, including cutoff_2000 = 01L)

Error in parse_date_time(x, cutoff_2000 = 1) :
  unused argument (cutoff_2000 = 1)

Thanks for your help!

Peter Nelson, PhD
Institute of Marine Sciences
University of California, Santa Cruz
Center for Ocean Health, Long Marine Lab
115 McAllistair Way
Santa Cruz, CA, 95076, USA

[[alternative HTML version deleted]]

R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Region Jämtland Härjedalen behandlar dina personuppgifter vid kommunikation via e-post. Hanteringen av personuppgifter följer gällande dataskyddslagstiftning. Du kan läsa mer om hur vi behandlar dina uppgifter på https://regionjh.se/gdpr

More information about the R-help mailing list