[R] Read Unicode text (*.txt)

Olivier Crouzet o||v|er@crouzet @end|ng |rom un|v-n@nte@@|r
Tue Jul 2 11:32:43 CEST 2019


Hi Javad,

I could not make sense of the data structure associated with the csv
file as it is copied in your previous message. Would you mind sending a
link so one can download your csv file directly (or at least the
first few lines) so people can check the exact properties of your file?

Yours.
Olivier.


On Tue, 2 Jul 2019 07:56:07 +0430 javad bayat
<j.bayat194 using gmail.com> wrote:

> Dear all;
> I use your suggestion but I gave the same warning messages. I changed
> the file name (Data.csv).
> "
> d4<-read.csv("./Data.csv",sep=";",header=TRUE,encoding="UTF-16")
>      Warning messages:
>          1: In read.table(file = file, header = header, sep = sep,
> quote = quote,  :
>            line 1 appears to contain embedded nulls
>           2: In read.table(file = file, header = header, sep = sep,
> quote = quote,  :
>           line 2 appears to contain embedded nulls
>           3: In read.table(file = file, header = header, sep = sep,
> quote = quote,  :
>           line 3 appears to contain embedded nulls
>           4: In read.table(file = file, header = header, sep = sep,
> quote = quote,  :
>            line 4 appears to contain embedded nulls
>           5: In read.table(file = file, header = header, sep = sep,
> quote = quote,  :
>             line 5 appears to contain embedded nulls
>           6: In read.table(file = file, header = header, sep = sep,
> quote = quote,  :
>            line 1 appears to contain embedded nulls
>           7: In scan(file = file, what = what, sep = sep, quote =
> quote, dec = dec,  :
>              embedded nul(s) found in input
> "
> 
> I opened the Data in notepad. This is the head of Data.csv. The
> columns have been separated by semicolons.
> "
> "INLET Time";"INLET ValueY";"TRATED WATER TANK Time";"TRATED WATER
> TANK ValueY"
> 10/28/2018;550.057861328125;10/28/2018;487.812530517578
> 10/28/2018 12:00:01 ق.ظ;550.057861328125;10/28/2018 12:00:01
> ق.ظ;487.812530517578
> 10/28/2018 12:00:02 ق.ظ;550.057861328125;10/28/2018 12:00:02
> ق.ظ;487.812530517578
> 10/28/2018 12:00:03 ق.ظ;550.057861328125;10/28/2018 12:00:03
> ق.ظ;487.812530517578
> 10/28/2018 12:00:04 ق.ظ;550.057861328125;10/28/2018 12:00:04
> ق.ظ;487.812530517578
> .
> .
> .
> "
> Thanks.
> 
> 
> 
> On Tue, Jul 2, 2019 at 6:14 AM Jeff Newmiller
> <jdnewmil using dcn.davis.ca.us> wrote:
> 
> > Don't be so US-centric, Abby... how do you know that javad's
> > version of Excel doesn't default to using semicolons?
> >
> > ?read.csv2
> >
> > On July 1, 2019 6:06:32 PM PDT, Abby Spurdle <spurdle.a using gmail.com>
> > wrote:
> > >> I am trying to read an excel CSV file (1.csv). When I read it as
> > >> csv
> > >file
> > >> in R, the R shows me the exact number of row. But it puts all
> > >> columns
> > >in
> > >> one column, while I have 3 or 4 columns in the data frame.
> > >> d4 = read.table("./4.csv",sep=";",header=TRUE)
> > >
> > >Firstly, I recommend against naming your file "1.csv".
> > >(Start with a letter not a number).
> > >
> > >Secondly, a CSV file should be separated by commas not semicolons.
> > >You can specify sep=",", however, it's probably easier to use the
> > >read.csv() function.
> > >
> > >Note that you should be able to open your file in a text editor to
> > >see the
> > >separators.
> > >
> > >> I dont know why in the "save as type" box Unicode text (*.txt)
> > >
> > >Other posters have suggested that you need to specify the encoding.
> > >Assuming that you create your CSV file correctly in Excel, I doubt
> > >that this is necessary, but I could be wrong...
> > >
> > >Your comment suggests that you have saved your document as "Unicode
> > >text".
> > >You need to tell Excel to save the file as a CSV file.
> > >(There should be a list of save options).
> > >
> > >Simply typing a file name with a .csv file extension is unlikely to
> > >produce
> > >the desired result.
> > >
> > >       [[alternative HTML version deleted]]
> > >
> > >______________________________________________
> > >R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> > >https://stat.ethz.ch/mailman/listinfo/r-help
> > >PLEASE do read the posting guide
> > >http://www.R-project.org/posting-guide.html
> > >and provide commented, minimal, self-contained, reproducible code.
> >
> > --
> > Sent from my phone. Please excuse my brevity.
> >
> 
> 
> -- 
> Best Regards
> Javad Bayat
> M.Sc. Environment Engineering
> Alternative Mail: bayat194 using yahoo.com
> 
> 	[[alternative HTML version deleted]]
> 
> ______________________________________________
> R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html and provide commented,
> minimal, self-contained, reproducible code.


-- 
  Olivier Crouzet, PhD
  /Maître de Conférences/
  @LLING - Laboratoire de Linguistique de Nantes
    UMR6310 CNRS / Université de Nantes
  /Guest Researcher/
  @UMCG (University Medical Center Groningen)
    ENT department
    Rijksuniversiteit Groningen



More information about the R-help mailing list