[R] "Denormalize" data

RobinLovelace rob00x at hotmail.com
Tue Aug 9 14:36:44 CEST 2011


Hello R users, 

My problem is that the data I've got is in the minimum number of columns
with each ward (geographic area) appearing multiple times. The first 30
terms look like this

> HHum02
         CASW  Btype   Yr CO2Group NumVeh
170597 00CCFA   CARS 2002        C      2
170598 00CCFA   CARS 2002        D      2
170599 00CCFA   CARS 2002        E     22
170600 00CCFA   CARS 2002        F     32
170601 00CCFA   CARS 2002        G     32
170602 00CCFA   CARS 2002        H     12
170603 00CCFA   CARS 2002        I     12
170604 00CCFA   CARS 2002        J      9
170605 00CCFA   CARS 2002     K(L)      8
170606 00CCFA   CARS 2002     K(M)      2
170607 00CCFA   CARS 2002        K      9
170608 00CCFA     AG 2002 non-cars      2
170609 00CCFA     BS 2002 non-cars      2
170610 00CCFA GHEAVY 2002 non-cars      9
170611 00CCFA GLIGHT 2002 non-cars     23
170612 00CCFA  MOTOS 2002 non-cars     24
170613 00CCFA OTHERS 2002 non-cars      6
170787 00CCFB   CARS 2002        D      1
170788 00CCFB   CARS 2002        E     11
170789 00CCFB   CARS 2002        F     12
170790 00CCFB   CARS 2002        G     20
170791 00CCFB   CARS 2002        H     17
170792 00CCFB   CARS 2002        I      4
170793 00CCFB   CARS 2002        J     10
170794 00CCFB   CARS 2002     K(L)      2
170795 00CCFB   CARS 2002     K(M)      1
170796 00CCFB   CARS 2002        K      5
170797 00CCFB GHEAVY 2002 non-cars      6
170798 00CCFB GLIGHT 2002 non-cars      4
170799 00CCFB  MOTOS 2002 non-cars     25

But what I need is for there to be only 1 row for each ward (e.g. 00CCFA).
This would mean adding extra columns and would look like this:

"CASW"	" C"	" D"	" E"	" F"	" G"	" H"	" I"	" J"	" K(L)"	" K(M)"	" K"
"non-cars"
"00CCFA"	2	2	22	32	32	12	12	9	8	2	9	66
"01CCFB"	0	1	11	12	20	17	4	10	2	1	5	35

I know R has the capability to do this, but people in my department only
know how to do this using STATA. I've explored various options. unstack()
seems to be the most appropriate but it's just not working using the default
formula: 

http://r.789695.n4.nabble.com/file/n3729817/Screenshot-*getting-started.txt_%28%7E-1Projects-OSS_general%29_-_gedit.png   

Look forward to learning, 

Robin


--
View this message in context: http://r.789695.n4.nabble.com/Denormalize-data-tp3729817p3729817.html
Sent from the R help mailing list archive at Nabble.com.



More information about the R-help mailing list