[R] elimination of multiple rows

John Day jday at csihq.com
Thu Jul 18 15:00:48 CEST 2002


Christian,

Sounds like you want something that can run as a preprocess step from the 
command line. If so (and if you're in Unix) you can use 'uniq', which will 
remove any adjacent dupe lines, but otherwise leave the file in the 
original order:

 > uniq myfile > umyfile

If you really want to get rid of _all_ dupes then sort first:

 > sort myfile | uniq > umyfile
OR
 > sort -u myfile > umyfile

BTW, 'uniq' has a nifty -c  option which counts the number of dupes and can 
be used to create a useful 'histogram', sorted by frequency. Must sort 
first to make this work and then sort again after the uniq to arrange in 
descending order:

sort myfile | uniq -c | sort -r -n

HTH,

John Day
Staff Scientist
Computer Science Innovations
Melbourne, FL
http://www.csi.cc/~jday

At 08:32 AM 7/18/02 +0200, you wrote:
>Hi all,
>
>Excuse me for posting a question which may not be true R.
>
>I want to process a text file so that the resulting file contains only one
>of possibly multiple consecutive rows. Example (the row numbers do not
>belong to the file):
>
>(1) a b c d
>(2) a b c d
>(3) A b c d
>(4) A b c d
>(5) A b c d
>(6) a b c d
>(7) a b c D
>..
>
>resulting in:
>
>(1) a b c d
>(3) A b c d
>(6) a b c d
>(7) a b c D
>..
>
>(6) could be disposed of also by first sorting the original file.
>
>Does anybody have a script ready, preferably in Pearl? I do not know Pearl
>well enough to write it myself.
>
>Thanks for your help.
>--christian
>
>Dr.sc.math.Christian W. Hoffmann
>Mathematics and Statistical Computing
>Landscape Dynamics and Spatial Development
>Swiss Federal Research Institute WSL
>Zuercherstrasse 111
>CH-8903 Birmensdorf, Switzerland
>phone: ++41-1-739 22 77    fax: ++41-1-739 22 15
>e-mail: christian.hoffmann at wsl.ch
>www: http://www.wsl.ch/staff/christian.hoffmann/
>
>-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
>r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
>Send "info", "help", or "[un]subscribe"
>(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
>_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._ 
>

-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-.-
r-help mailing list -- Read http://www.ci.tuwien.ac.at/~hornik/R/R-FAQ.html
Send "info", "help", or "[un]subscribe"
(in the "body", not the subject !)  To: r-help-request at stat.math.ethz.ch
_._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._._



More information about the R-help mailing list