[R] Calculate mileage at each 'faildate'

Abhinaba Roy abhinabaroy09 at gmail.com
Wed Aug 10 11:22:53 CEST 2016


Hi R-helpers,


   - I have a dataframe similar to 'simD'.

> dput(simD)
structure(list(ID = structure(c(1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L,
2L, 2L), .Label = c("A", "B"), class = "factor"), PRODDATE =
structure(c(14655,
14655, 14655, 14655, 14664, 14664, 14664, 14664, 14664, 14664
), class = "Date"), FAILDATE = structure(c(15053, 15054, 15057,
15058, 14844, 14875, 14876, 14905, 14936, 14966), class = "Date"),
    MILEAGE = c(21548L, 22578L, 22868L, 23654L, 30245L, 32148L,
    34128L, 35879L, 39874L, 40125L)), .Names = c("ID", "PRODDATE",
"FAILDATE", "MILEAGE"), row.names = c(NA, -10L), class = "data.frame")


   - I have split the dataset by 'ID' and sorted the dataframe by
   'faildate' (oldest to newest).



   - Now, for each of the splits I want to calculate the 'Mileage' at each
   of the 'faildate'.



   - The output I desire is 'outD'.


> dput(outD)
structure(list(ID = structure(1:2, .Label = c("A", "B"), class = "factor"),
    PRODDATE = structure(c(14655, 14664), class = "Date"), MIL_1 =
c(21548L,
    30245L), MIL_2 = c(1030L, 1903L), MIL_3 = c(290L, 1980L),
    MIL_4 = c(786L, 1751L), MIL_5 = c(NA, 3995L), MIL_6 = c(NA,
    251L)), .Names = c("ID", "PRODDATE", "MIL_1", "MIL_2", "MIL_3",
"MIL_4", "MIL_5", "MIL_6"), row.names = c(NA, -2L), class = "data.frame")

***Please note that I have MIL_6 because the max(# failures) in data by ID
is 6 (6 failures for 'B')*


   - And I would like to extend it to other numeric and date variable as
   well.


How can I do this in R?

Best,
Abhinaba

	[[alternative HTML version deleted]]



More information about the R-help mailing list