hey stats geeks...
Sep. 25th, 2011 05:49 pmLet's say you have a situation in which you're tempted to use a t-test. However, not all the values came from different subjects. One subject was read 4 times, one subject was read 2 times, and the other 3 reads are each from a different subject. (Don't ask me why this was done, not my experiment...) This so totally violates the assumption of samples being independent, right? What would you do? Take an average for each subject, and do a t-test on the averages for each subject?
no subject
Date: 2011-09-26 12:58 am (UTC)no subject
Date: 2011-09-26 01:40 am (UTC)Question 1: Are there are really only 5 subjects in the trial to begin with?
Let's label the subject that did 4 trials S4 and the one that did 2 trials S2. So basically S2 gave you TMI and S4 gave you way too much I.
Question 2: Suppose you didn't know that was the case. The lab techs secretary
decided to randomly select only one trial from each subject and hand
those results to you. Would you consider the t-test appropriate to
use here? If so, run your test for each possible input set the lab tech
secretary could have handed you. If you get a decent p value in all or
most cases, consider the matter worth looking into and dweeb at whoever
should be dweebed at to design a better followup and run it. Otherwise
reject your hypothesis and move on.
no subject
Date: 2011-09-26 02:29 am (UTC)If you had more people, and the 4 runs were "ordered" in some way, and knew, when a person had less than 4 runs which runs they did or did not have, *and* that the missing runs were "missing completely at random" or possibly "missing at random", and the data met a bunch of other assumptions, you might be able to move into the territory of "mixed effects modeling."
But barring that, f's approach would be good for poking around and exploring...
However.... I'm thinking from a hypothesis-testing point of view (because that's the stats world I live in the most) and I don't really see a hypothesis anywhere.... so I'd wonder just what hypothesis is being tested anyway.
(Oh, and if you had just one person doing the exp't over and over, you'd need to worry about carryover when setting up the pairs.... e.g. is the result of the 2nd pair influenced by the first, the third pair influenced by the first and second, etc.)