[R] Normalize GEO Data

Ebert,Timothy Aaron tebert @end|ng |rom u||@edu
Mon Aug 15 17:48:50 CEST 2022


One approach for adjusted p-values is to divide by the "magic cut-off" value by the number of tests. Typically one decides significance if the p-value is lass than 0.05. So not it is 0.05/(number of tests). This can be too conservative, but sometimes there are not many options. If you have a multiple comparison procedure (like the mean difference between five treatments) then there are tests like Tukey's HSD test, and many others that try for more balance.

"They are all 1" is possible but unlikely that all p-values would equal 1 except for one vale of 0.636. Maybe some data was not copied correctly, or maybe the model was not built correctly, or data not read in correctly. Please check that the data the computer has in memory matches the data in your file

Tim

-----Original Message-----
From: R-help <r-help-bounces using r-project.org> On Behalf Of Anas Jamshed
Sent: Monday, August 15, 2022 11:34 AM
To: R-help Mailing List <r-help using r-project.org>
Subject: [R] Normalize GEO Data

[External Email]

I performed GEO2R analysis on a series dataset and I'm looking to find the up-regulated and down-regulated genes. I know that to find up-regulated and down-regulated genes, I should check logFC (Fold-change in log2 scale (generally)).Consider the value of 1 in log2 is 0. There is optimal cutoff but log2 > 1 indicates up-regulation and log2 < -1 indicates down-regulation of genes. Moreover, I should consider adj.p.val which is the adjusted p-value (corrected p-value dues multiple comparisons). Again there is no generally accepted cutoff but I should consider values < 0.05 which indicates the test is statistically significant.
But the problem is in this particular GSE series none of the adj.p.value is < 0.05 - they are all "1" and a "0.636". However, the logFC values are >1, but none of the samples have a condition of "p <0.05 & logFC > 1".
So, can it be said that in this case, I need to normalise my data to find out the DEGs, up-regulated and down-regulated genes in a series GEO file?

        [[alternative HTML version deleted]]

______________________________________________
R-help using r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://nam10.safelinks.protection.outlook.com/?url=https%3A%2F%2Fstat.ethz.ch%2Fmailman%2Flistinfo%2Fr-help&data=05%7C01%7Ctebert%40ufl.edu%7Ceee701d2e54a41e01f9808da7ed3c46b%7C0d4da0f84a314d76ace60a62331e1b84%7C0%7C0%7C637961745255003828%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=CSx3odPuL1fZhvyyHBuq2p24a%2Fi9V7qqIMC9UMp3yZ0%3D&reserved=0
PLEASE do read the posting guide https://nam10.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.r-project.org%2Fposting-guide.html&data=05%7C01%7Ctebert%40ufl.edu%7Ceee701d2e54a41e01f9808da7ed3c46b%7C0d4da0f84a314d76ace60a62331e1b84%7C0%7C0%7C637961745255003828%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=tlLb%2B9%2BlfexcLPRRpb%2FPvby5%2Bf3cC5jOWB8t5k6JdNU%3D&reserved=0
and provide commented, minimal, self-contained, reproducible code.



More information about the R-help mailing list