[R] Mailinglist

Jim Lemon drjimlemon @ending from gm@il@com
Sun Jan 6 10:12:46 CET 2019


Hi Rachel,
It looks to me as though the first thing you want to do is to get your
data, which you attach as images, into a data frame. If these are flat
files like CSV or TAB, you should be able to read them in with some
variant of the read.table function. If Excel, look at the various
Excel import packages. Then you can operate on the data frame by doing
things like tabulating Participant ID against the code for SMS or call
(which I assume are those 3000+ numbers). You can take the differences
in what look like POSIX time values between successive TRUE and FALSE
screen values to get the duration of screen activity and it looks like
participant activity is recorded at regular intervals. As Jeff
suggested, this is really just boring work figuring out how to extract
the events:

call_indices<-which(Probetype == xxxxxxCallLogProbe & ValueSpecified
== _id  & Valuedetailed ==3271)

using suitable logical statements and then tabulating them by
ParticipantID. If you know how to do that in SPSS, it won't be too
hard to translate the logical statements into R syntax as above. I may
have misunderstood the variable names, but I think the logic is clear.

Jim

On Sun, Jan 6, 2019 at 4:07 PM Rachel Thompson
<rachel.thompson using student.uva.nl> wrote:
>
> Hi Jim,
>
> Thank you for the clarification. Since I only work in SPSS and I am from Amsterdam I have had problems with specifying what I am trying to do in this specific program and also in clear English language.
>
> I think I want to indeed aggregate these events for each subject over the observation. But in this case several observations.
> 1. I want to have a summary of how many times a specific subject got called (CallLogProbe)
> 2. I want to have a summary of how many times a specific subject got a text message (SMS probe)
> 3. I want to have a summary of how many times a specific subject
> - Turned their screen on - True  (ScreenProbe)
> - Or did not turn their screen on - False (ScreenProbe)
> 4.  I want to have a summary of the activity level of a specific subject
> - Activity level - none (ActivityProbe)
> - Activity level- low     (ActivityProbe)
> - Activity level - High  (ActivityProbe)
>
> I want to do this for all the 36 subjects(Participants).
>
> In the end, I have to define percentages, so I am able to say...Subject 36 has low social interactions ( because they only got called and texted 500 times in total, while the average of all the participants is 10000 or something). I have to come up with the percentages myself and define cutoff points of what is considered low-medium-high, based on what the results of all the subjects are.
>
> I hope that I am as clear as possible .
>
>
> I feel as if I am on my way of understanding it, but since I do not clearly know, I am trying out a lot of different codes etc. and I do not know if I am doing the right thing. I indeed made a new data frame etc, but I still feel a bit lost. Do I need to make one per subject or per Probe etc..
>
>
> Thanks for your help. I hope that you can help me resolve this issue.
>
>
> Best,
>
>
> Rachel
>
>
>
>
>
>
> On Sat, Jan 5, 2019 at 9:03 PM Jim Lemon <drjimlemon using gmail.com> wrote:
>>
>> Hi Rachel,
>> I'll take a guess and assume that you are monitoring the mobile phones
>> of 36 people, adding an observation every time some specified change
>> of state is sensed on each device. I'll also assume that you are only
>> recording four types of measurement. It seems that you want to
>> aggregate these events for each subject over the interval or
>> observation (or over each day or something). I think you are going to
>> create a new data frame of these summaries from the one you have of
>> individual observations. Creating each summary doesn't look too hard,
>> but you will have to define more precisely what you want those
>> summaries to be. For instance, "I want the mean activity level for
>> each subject during the overall time that their mobile phone is
>> switched on", One you have clearly defined your goals, it probably
>> won't be too hard to get to them.
>>
>> Jim
>>
>> On Sun, Jan 6, 2019 at 5:39 AM Rachel Thompson
>> <rachel.thompson using student.uva.nl> wrote:
>> >
>> > Dear Mr/Mrs,
>> >
>> > This is my first time working in R studio.
>> > I have a database of 36 participants but it has 150600 entries.
>> > Column -         Column - Column            - Column
>> >
>> > Participant       Activityprobe - Activity Level  - High/low/none
>> >
>> > Participant       Screenprobe - screenon/off     -
>> >
>> > Participant       SMSprobe etc
>> >
>> > Participant       CallLogProbe etc.
>> >
>> > I need a code that helps me count the activity level of all the participants
>> > High activity level. No activity level and Low activity level.
>> > And to help me find out for every participant what the percentages are of
>> > all their high/no/low activity.
>> >
>> > For screenprobe I need to count how many times the participant turned their
>> > screen on and how many times they turned it off and the percentage of
>> > screen on/off.
>> >
>> > For callLog I need to count how many times each participant got called and
>> > the percentage.
>> >
>> > For SMS I need to count the number of SMS for each participant and their
>> > percentage.
>> >
>> > I also need to categorize the probes. So that my database shows all the
>> > activity levels first, organized by none/high/low and then all the
>> > screenprobes, organized by on and off etc...
>> >
>> > I hope that my description is clear and that you can maybe help me.
>> >
>> > Best,
>> >
>> > Rachel
>> >
>> >         [[alternative HTML version deleted]]
>> >
>> > ______________________________________________
>> > R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
>> > https://stat.ethz.ch/mailman/listinfo/r-help
>> > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> > and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list