[R] Temperature Prediction Model

Clint Bowman clint at ecy.wa.gov
Mon Oct 26 17:22:39 CET 2009


Aneeta,

My "gorilla and mouse" analogies were referring to the magnitude of 
the disturbance and also to its time signature.  Are you only 
interested in the large disturbance which is abrupt (the gorilla)? 
Or do you also want to be able to detect the more surreptitious 
attack which may be quite gradual (the mouse)?

You will want to define the magnitude (and perhaps the associated 
duration) of the smallest disturbance that would be important.  I 
would look at the entire data set to see what would be the 
likelihood of detecting such a change given the noise in the 
temperature data.  Or alternatively, use the global analysis to 
help define the minimum disturbance that could be detected.

Then see what can be done with just the first 7 days of data (or 
for matter the past 7 days regardless of when they occur).

I applaude your goal of looking at each sensor without referring to 
other nodes but I think I would develop the analysis by looking for 
anomalies in one sensor's data when compared with other sensors and 
then focusing on those periods to determine an approach for 
detecting a disturbance.

Because you are looking at 7 days, should we assume that you expect 
a day-of-week dependence?  If so, I'd be more comfortable if you 
used more than one week to develop it.

I fear that you've gotten me quite interested in this analysis, 
good luck.

Clint

-- 
Clint Bowman			INTERNET:	clint at ecy.wa.gov
Air Quality Modeler		INTERNET:	clint at math.utah.edu
Department of Ecology		VOICE:		(360) 407-6815
PO Box 47600			FAX:		(360) 407-7534
Olympia, WA 98504-7600

On Sun, 25 Oct 2009, Aneeta wrote:

>
> Thank you everyone for all the responses.
>
> Clint you are correct in assuming that the problem deals with sensors in a
> lab setup which can be assumed to be isolated from outside temperature
> changes. And, I am only dealing with temperature so the other parameters are
> not important.
>
> There will be no gorillas or mouses in the picture but rather some malicious
> attacker who would try to cause disturbances in the normal readings. That is
> why it is important to have an equation that defines 'normal behaviour'.
>
> The data-sets contain readings for multiple days. I want to take the first 7
> days for each node and establish a relationship between time(column 2) and
> temperature(column 4).
>
> My objective is not to model temperature variation throughout the year and
> take into consideration climatic changes. Rather, it is to define a model
> for the given data which happens to be temperature recorded by nodes. In a
> simple way we may look at it as a set of X(time) and Y(temperature) values
> where I am trying to define Y in terms of X.
>
> How should I approach this problem?
>
> Many Thanks,
> Aneeta
>
>
> Clint Bowman wrote:
>>
>> Aneeta,
>>
>> If I understand the figure at
>> <http://db.csail.mit.edu/labdata/labdata.html> this problem deals
>> with sensors in a lab that is probably isolated from outdoor
>> temperature changes.
>>
>> I assume the predictive model must detect when a "rampaging 800
>> pound gorilla" messes with a sensor.  Do we also have to detect the
>> pawing of a "micro-mouse" as well?
>>
>> The collected data also seem to have other parameters which would
>> be valuable--are you limited to just temperature?
>>
>> Clint
>>
>> --
>> Clint Bowman			INTERNET:	clint at ecy.wa.gov
>> Air Quality Modeler		INTERNET:	clint at math.utah.edu
>> Department of Ecology		VOICE:		(360) 407-6815
>> PO Box 47600			FAX:		(360) 407-7534
>> Olympia, WA 98504-7600
>>
>> On Thu, 22 Oct 2009, Thomas Adams wrote:
>>
>>> Aneeta,
>>>
>>> You will have to have a seasonal component built into your model, because
>>> the
>>> seasonal variation does matter, particularly -where- you are
>>> geographically
>>> (San Diego, Chicago, Denver, Miami are very different). Generally, there
>>> is a
>>> sinusoidal daily temperature variation, but frontal passages and
>>> thunderstorms, etc., can and will disrupt this nice pattern. You may have
>>> to
>>> tie this into temperature predictions from a mesoscale numerical weather
>>> prediction model. Otherwise, you will end up with lots of misses and
>>> false
>>> alarms…
>>>
>>> Regards,
>>> Tom
>>>
>>> Aneeta wrote:
>>>>  The data that I use has been collected by a sensor network deployed by
>>>>  Intel.
>>>>  You may take a look at the network at the following website
>>>>  http://db.csail.mit.edu/labdata/labdata.html
>>>>
>>>>  The main goal of my project is to simulate a physical layer attack on a
>>>>  sensor network and to detect such an attack. In order to detect an
>>>> attack
>>>>  I
>>>>  need to have a model that would define the normal behaviour. So the
>>>> actual
>>>>  variation of temperature throughout the year is not very important out
>>>>  here.
>>>>  I have a set of data for a period of 7 days which is assumed to be the
>>>>  correct behaviour and I need to build a model upon that data. I may
>>>> refine
>>>>  the model later on to take into account temperature variations
>>>> throughout
>>>>  the year.
>>>>
>>>>  Yes I am trying to build a model that will predict the temperature just
>>>> on
>>>>  the given time of the day so that I am able to compare it with the
>>>>  observed
>>>>  temperature and determine if there is any abnormality. Each node should
>>>>  have
>>>>  its own expectation model (i.e. there will be no correlation between
>>>> the
>>>>  readings of the different nodes).
>>>>
>>>>
>>>>  Steve Lianoglou-6 wrote:
>>>>
>>>>>  Hi,
>>>>>
>>>>>  On Oct 21, 2009, at 12:31 PM, Aneeta wrote:
>>>>>
>>>>>
>>>>>>  Greetings!
>>>>>>
>>>>>>  As part of my research project I am using R to study temperature
>>>> data
>>>>>>  collected by a network. Each node (observation point) records
>>>>>>  temperature of
>>>>>>  its surroundings throughout the day and generates a dataset. Using
>>>> the
>>>>>>  recorded datasets for the past 7 days I need to build a prediction
>>>>>>  model for
>>>>>>  each node that would enable it to check the observed data against
>>>> the
>>>>>>  predicted data. How can I derive an equation for temperature using
>>>> the
>>>>>>  datasets?
>>>>>>  The following is a subset of one of the datasets:-
>>>>>>
>>>>>>       Time              Temperature
>>>>>>
>>>>>>  07:00:17.369668   17.509
>>>>>>  07:03:17.465725   17.509
>>>>>>  07:04:17.597071   17.509
>>>>>>  07:05:17.330544   17.509
>>>>>>  07:10:47.838123   17.5482
>>>>>>  07:14:16.680696   17.5874
>>>>>>  07:16:46.67457     17.5972
>>>>>>  07:29:16.887654   17.7442
>>>>>>  07:29:46.705759   17.754
>>>>>>  07:32:17.131713   17.7932
>>>>>>  07:35:47.113953   17.8324
>>>>>>  07:36:17.194981   17.8324
>>>>>>  07:37:17.227013   17.852
>>>>>>  07:38:17.809174   17.8618
>>>>>>  07:38:48.00011     17.852
>>>>>>  07:39:17.124362   17.8618
>>>>>>  07:41:17.130624   17.8912
>>>>>>  07:41:46.966421   17.901
>>>>>>  07:43:47.524823   17.95
>>>>>>  07:44:47.430977   17.95
>>>>>>  07:45:16.813396   17.95
>>>>>>
>>>>>  I think you/we need much more information.
>>>>>
>>>>>  Are you really trying to build a model that predicts the temperature
>>>>>  just given the time of day?
>>>>>
>>>>>  Given that you're in NY, I'd say 12pm in August sure feels much
>>>>>  different than 12pm in February, no?
>>>>>
>>>>>  Or are you trying to predict what one sensor readout would be at a
>>>>>  particular time given readings from other sensors at the same time?
>>>>>
>>>>>  Or ... ?
>>>>>
>>>>>  -steve
>>>>>
>>>>>  --
>>>>>  Steve Lianoglou
>>>>>  Graduate Student: Computational Systems Biology
>>>>> |   Memorial Sloan-Kettering Cancer Center
>>>>> |   Weill Medical College of Cornell University
>>>>>  Contact Info: http://cbio.mskcc.org/~lianos/contact
>>>>>
>>>>>  ______________________________________________
>>>>>  R-help at r-project.org mailing list
>>>>>  https://stat.ethz.ch/mailman/listinfo/r-help
>>>>>  PLEASE do read the posting guide
>>>>>  http://www.R-project.org/posting-guide.html
>>>>>  and provide commented, minimal, self-contained, reproducible code.
>>>>>
>>>>>
>>>>>
>>>>
>>>>
>>>
>>>
>>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide
>> http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>>
>>
>
>


More information about the R-help mailing list