Two sample t-test to show equality of the two means
Given two (numeric) samples I would like to show that there is not a significant difference between the two means $mu_{1}$ and $mu_{2}$.
If my goal was to show a significant difference I would formulate the $t$-test as follows:
(1) $H_{0}: mu_{1} = mu_{2}$ vs $H_{1}: mu_{1} neq mu_{2}$
I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show.
So then if my goal is to show that there is not a significant difference between both means, should I formulate the test like this?
(2) $H_{0}: mu_{1} neq mu_{2}$ vs $H_{1}: mu_{1} = mu_{2}$
Or can I use the first test (1) and when I am not able to reject the null hypothesis say that there is not a significant difference?
hypothesis-testing t-test equivalence
add a comment |
Given two (numeric) samples I would like to show that there is not a significant difference between the two means $mu_{1}$ and $mu_{2}$.
If my goal was to show a significant difference I would formulate the $t$-test as follows:
(1) $H_{0}: mu_{1} = mu_{2}$ vs $H_{1}: mu_{1} neq mu_{2}$
I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show.
So then if my goal is to show that there is not a significant difference between both means, should I formulate the test like this?
(2) $H_{0}: mu_{1} neq mu_{2}$ vs $H_{1}: mu_{1} = mu_{2}$
Or can I use the first test (1) and when I am not able to reject the null hypothesis say that there is not a significant difference?
hypothesis-testing t-test equivalence
1
The alternative hypothesis indicates what an extreme result might look like. The problem with your (2) formulation is that this would be a difference in means close to $0$; so if you took a commonly used significance level of $5%$ then the power of the test (its ability to reject the null hypothesis when it is false) would never be above $5%$ no matter how large the sample size. This is not good
– Henry
2 days ago
"I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show." Then you were taught to commit to confirmation bias as a mode of scientific inquiry.
– Alexis
yesterday
add a comment |
Given two (numeric) samples I would like to show that there is not a significant difference between the two means $mu_{1}$ and $mu_{2}$.
If my goal was to show a significant difference I would formulate the $t$-test as follows:
(1) $H_{0}: mu_{1} = mu_{2}$ vs $H_{1}: mu_{1} neq mu_{2}$
I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show.
So then if my goal is to show that there is not a significant difference between both means, should I formulate the test like this?
(2) $H_{0}: mu_{1} neq mu_{2}$ vs $H_{1}: mu_{1} = mu_{2}$
Or can I use the first test (1) and when I am not able to reject the null hypothesis say that there is not a significant difference?
hypothesis-testing t-test equivalence
Given two (numeric) samples I would like to show that there is not a significant difference between the two means $mu_{1}$ and $mu_{2}$.
If my goal was to show a significant difference I would formulate the $t$-test as follows:
(1) $H_{0}: mu_{1} = mu_{2}$ vs $H_{1}: mu_{1} neq mu_{2}$
I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show.
So then if my goal is to show that there is not a significant difference between both means, should I formulate the test like this?
(2) $H_{0}: mu_{1} neq mu_{2}$ vs $H_{1}: mu_{1} = mu_{2}$
Or can I use the first test (1) and when I am not able to reject the null hypothesis say that there is not a significant difference?
hypothesis-testing t-test equivalence
hypothesis-testing t-test equivalence
edited yesterday
Alexis
16k34595
16k34595
asked 2 days ago
cmplx96cmplx96
20316
20316
1
The alternative hypothesis indicates what an extreme result might look like. The problem with your (2) formulation is that this would be a difference in means close to $0$; so if you took a commonly used significance level of $5%$ then the power of the test (its ability to reject the null hypothesis when it is false) would never be above $5%$ no matter how large the sample size. This is not good
– Henry
2 days ago
"I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show." Then you were taught to commit to confirmation bias as a mode of scientific inquiry.
– Alexis
yesterday
add a comment |
1
The alternative hypothesis indicates what an extreme result might look like. The problem with your (2) formulation is that this would be a difference in means close to $0$; so if you took a commonly used significance level of $5%$ then the power of the test (its ability to reject the null hypothesis when it is false) would never be above $5%$ no matter how large the sample size. This is not good
– Henry
2 days ago
"I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show." Then you were taught to commit to confirmation bias as a mode of scientific inquiry.
– Alexis
yesterday
1
1
The alternative hypothesis indicates what an extreme result might look like. The problem with your (2) formulation is that this would be a difference in means close to $0$; so if you took a commonly used significance level of $5%$ then the power of the test (its ability to reject the null hypothesis when it is false) would never be above $5%$ no matter how large the sample size. This is not good
– Henry
2 days ago
The alternative hypothesis indicates what an extreme result might look like. The problem with your (2) formulation is that this would be a difference in means close to $0$; so if you took a commonly used significance level of $5%$ then the power of the test (its ability to reject the null hypothesis when it is false) would never be above $5%$ no matter how large the sample size. This is not good
– Henry
2 days ago
"I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show." Then you were taught to commit to confirmation bias as a mode of scientific inquiry.
– Alexis
yesterday
"I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show." Then you were taught to commit to confirmation bias as a mode of scientific inquiry.
– Alexis
yesterday
add a comment |
2 Answers
2
active
oldest
votes
You cannot use the first test in the way you describe, because failure to reject in the first test only says that you were unable to reject $H_0$ nothing more than that. It is like only being given the information that "the prosecutor was unable to provide the jury with enough evidence to secure a conviction" - that does not tell you that the suspect is innocent.
The second test is not usable in practice, because no matter how much data you have, you cannot exclude the possibility of very small differences.
What you can do is to look at
$$H_{0}: |mu_{1} - mu_{2}|>delta text{ vs }H_{1}: |mu_{1} - mu_{2}| leq delta,$$
i.e. try to reject the null hypothesis that the absolute size of the difference is greater than some difference $delta>0$. $delta$ would be chosen e.g. so that any difference smaller than that is for all (or your specific) practical purposes irrelevant.
Thanks! How would I then go about computing the test statistic? t = (x1 - x1 - delta) / sqrt(s1^2/n1 + s2^2/n2) ?
– cmplx96
2 days ago
3
Not quite, you look in both directions, i.e. do two one-sided tests (en.wikipedia.org/wiki/Equivalence_test). Formulae for these are given e.g. here ncss.wpengine.netdna-cdn.com/wp-content/themes/ncss/pdf/….
– Björn
2 days ago
Note that we have a considerably informative tag on two one-sided tests here. That's what the [tost] tag is for. :)
– Alexis
yesterday
add a comment |
I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show.
That is not accurate explanation of the null hypothesis. The null hypothesis is simply a hypothesis that consists of a specific distribution from which probabilities can be calculated. The reason we use $mu_1=mu_2$ as the null hypothesis has nothing to do with whether this is the "common" belief. It's used as the null hypothesis because if we hypothesize that the mean is a specific value, then given a particular set of data we can calculate the probability of seeing that data. We can't use $mu neq mu_2$ as our null hypothesis because there's no way to calculate p-values based simply on the hypothesis that the means aren't equal to a particular value. Consider the following problem:
The weights of apples have a standard deviation of 5 grams. The mean is not equal to 100. What is the probability of seeing an apple with a weight of 110 grams?
There's no way to answer that, because simply being told what the mean isn't is not enough to calculate probabilities.
Björn suggests testing the hypothesis that the difference in means is greater than some $delta_0$. How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_0$. Then once you have the data, you can calculate the p-value given that $delta_0$. Call that $p_{delta_0}$. If the difference in sample means is less than $delta_0$, then the the p-value would have been even smaller than $p_{delta_0}$ if we had chosen $delta$ to be larger than $delta_0$. We reject the null if the p-value is less than $alpha$, so if we're rejecting under that null, that means that $p_{delta_0} < alpha$. And since $p_{delta}<p_{delta_0}$ for any $delta>delta_0$, we can conclude that $p_{delta}<alpha$ for any $delta>delta_0$. Thus, we can not only reject this null of $delta_0$, but we can reject any null with a larger $delta$. It is only because of this ability to get an upper bound on p that we don't need a specific value for $delta$. If we just take "$delta$ is larger than zero" as our null hypothesis, without any lower bound for $delta$, then there is no upper bound for p, and so we cannot conclude that it is lower than $alpha$.
1
You seem quite confused about how two one-sided tests for equivalence work: "How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_{0}$." is not remotely close to these procedures.
– Alexis
yesterday
@Alexis That is the rigorous mathematical theoretical foundation of the process. Certainly, there are people doing statistics in the field that are not engaging in full rigor.
– Acccumulation
yesterday
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "65"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f386317%2ftwo-sample-t-test-to-show-equality-of-the-two-means%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
You cannot use the first test in the way you describe, because failure to reject in the first test only says that you were unable to reject $H_0$ nothing more than that. It is like only being given the information that "the prosecutor was unable to provide the jury with enough evidence to secure a conviction" - that does not tell you that the suspect is innocent.
The second test is not usable in practice, because no matter how much data you have, you cannot exclude the possibility of very small differences.
What you can do is to look at
$$H_{0}: |mu_{1} - mu_{2}|>delta text{ vs }H_{1}: |mu_{1} - mu_{2}| leq delta,$$
i.e. try to reject the null hypothesis that the absolute size of the difference is greater than some difference $delta>0$. $delta$ would be chosen e.g. so that any difference smaller than that is for all (or your specific) practical purposes irrelevant.
Thanks! How would I then go about computing the test statistic? t = (x1 - x1 - delta) / sqrt(s1^2/n1 + s2^2/n2) ?
– cmplx96
2 days ago
3
Not quite, you look in both directions, i.e. do two one-sided tests (en.wikipedia.org/wiki/Equivalence_test). Formulae for these are given e.g. here ncss.wpengine.netdna-cdn.com/wp-content/themes/ncss/pdf/….
– Björn
2 days ago
Note that we have a considerably informative tag on two one-sided tests here. That's what the [tost] tag is for. :)
– Alexis
yesterday
add a comment |
You cannot use the first test in the way you describe, because failure to reject in the first test only says that you were unable to reject $H_0$ nothing more than that. It is like only being given the information that "the prosecutor was unable to provide the jury with enough evidence to secure a conviction" - that does not tell you that the suspect is innocent.
The second test is not usable in practice, because no matter how much data you have, you cannot exclude the possibility of very small differences.
What you can do is to look at
$$H_{0}: |mu_{1} - mu_{2}|>delta text{ vs }H_{1}: |mu_{1} - mu_{2}| leq delta,$$
i.e. try to reject the null hypothesis that the absolute size of the difference is greater than some difference $delta>0$. $delta$ would be chosen e.g. so that any difference smaller than that is for all (or your specific) practical purposes irrelevant.
Thanks! How would I then go about computing the test statistic? t = (x1 - x1 - delta) / sqrt(s1^2/n1 + s2^2/n2) ?
– cmplx96
2 days ago
3
Not quite, you look in both directions, i.e. do two one-sided tests (en.wikipedia.org/wiki/Equivalence_test). Formulae for these are given e.g. here ncss.wpengine.netdna-cdn.com/wp-content/themes/ncss/pdf/….
– Björn
2 days ago
Note that we have a considerably informative tag on two one-sided tests here. That's what the [tost] tag is for. :)
– Alexis
yesterday
add a comment |
You cannot use the first test in the way you describe, because failure to reject in the first test only says that you were unable to reject $H_0$ nothing more than that. It is like only being given the information that "the prosecutor was unable to provide the jury with enough evidence to secure a conviction" - that does not tell you that the suspect is innocent.
The second test is not usable in practice, because no matter how much data you have, you cannot exclude the possibility of very small differences.
What you can do is to look at
$$H_{0}: |mu_{1} - mu_{2}|>delta text{ vs }H_{1}: |mu_{1} - mu_{2}| leq delta,$$
i.e. try to reject the null hypothesis that the absolute size of the difference is greater than some difference $delta>0$. $delta$ would be chosen e.g. so that any difference smaller than that is for all (or your specific) practical purposes irrelevant.
You cannot use the first test in the way you describe, because failure to reject in the first test only says that you were unable to reject $H_0$ nothing more than that. It is like only being given the information that "the prosecutor was unable to provide the jury with enough evidence to secure a conviction" - that does not tell you that the suspect is innocent.
The second test is not usable in practice, because no matter how much data you have, you cannot exclude the possibility of very small differences.
What you can do is to look at
$$H_{0}: |mu_{1} - mu_{2}|>delta text{ vs }H_{1}: |mu_{1} - mu_{2}| leq delta,$$
i.e. try to reject the null hypothesis that the absolute size of the difference is greater than some difference $delta>0$. $delta$ would be chosen e.g. so that any difference smaller than that is for all (or your specific) practical purposes irrelevant.
answered 2 days ago
BjörnBjörn
10.2k11039
10.2k11039
Thanks! How would I then go about computing the test statistic? t = (x1 - x1 - delta) / sqrt(s1^2/n1 + s2^2/n2) ?
– cmplx96
2 days ago
3
Not quite, you look in both directions, i.e. do two one-sided tests (en.wikipedia.org/wiki/Equivalence_test). Formulae for these are given e.g. here ncss.wpengine.netdna-cdn.com/wp-content/themes/ncss/pdf/….
– Björn
2 days ago
Note that we have a considerably informative tag on two one-sided tests here. That's what the [tost] tag is for. :)
– Alexis
yesterday
add a comment |
Thanks! How would I then go about computing the test statistic? t = (x1 - x1 - delta) / sqrt(s1^2/n1 + s2^2/n2) ?
– cmplx96
2 days ago
3
Not quite, you look in both directions, i.e. do two one-sided tests (en.wikipedia.org/wiki/Equivalence_test). Formulae for these are given e.g. here ncss.wpengine.netdna-cdn.com/wp-content/themes/ncss/pdf/….
– Björn
2 days ago
Note that we have a considerably informative tag on two one-sided tests here. That's what the [tost] tag is for. :)
– Alexis
yesterday
Thanks! How would I then go about computing the test statistic? t = (x1 - x1 - delta) / sqrt(s1^2/n1 + s2^2/n2) ?
– cmplx96
2 days ago
Thanks! How would I then go about computing the test statistic? t = (x1 - x1 - delta) / sqrt(s1^2/n1 + s2^2/n2) ?
– cmplx96
2 days ago
3
3
Not quite, you look in both directions, i.e. do two one-sided tests (en.wikipedia.org/wiki/Equivalence_test). Formulae for these are given e.g. here ncss.wpengine.netdna-cdn.com/wp-content/themes/ncss/pdf/….
– Björn
2 days ago
Not quite, you look in both directions, i.e. do two one-sided tests (en.wikipedia.org/wiki/Equivalence_test). Formulae for these are given e.g. here ncss.wpengine.netdna-cdn.com/wp-content/themes/ncss/pdf/….
– Björn
2 days ago
Note that we have a considerably informative tag on two one-sided tests here. That's what the [tost] tag is for. :)
– Alexis
yesterday
Note that we have a considerably informative tag on two one-sided tests here. That's what the [tost] tag is for. :)
– Alexis
yesterday
add a comment |
I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show.
That is not accurate explanation of the null hypothesis. The null hypothesis is simply a hypothesis that consists of a specific distribution from which probabilities can be calculated. The reason we use $mu_1=mu_2$ as the null hypothesis has nothing to do with whether this is the "common" belief. It's used as the null hypothesis because if we hypothesize that the mean is a specific value, then given a particular set of data we can calculate the probability of seeing that data. We can't use $mu neq mu_2$ as our null hypothesis because there's no way to calculate p-values based simply on the hypothesis that the means aren't equal to a particular value. Consider the following problem:
The weights of apples have a standard deviation of 5 grams. The mean is not equal to 100. What is the probability of seeing an apple with a weight of 110 grams?
There's no way to answer that, because simply being told what the mean isn't is not enough to calculate probabilities.
Björn suggests testing the hypothesis that the difference in means is greater than some $delta_0$. How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_0$. Then once you have the data, you can calculate the p-value given that $delta_0$. Call that $p_{delta_0}$. If the difference in sample means is less than $delta_0$, then the the p-value would have been even smaller than $p_{delta_0}$ if we had chosen $delta$ to be larger than $delta_0$. We reject the null if the p-value is less than $alpha$, so if we're rejecting under that null, that means that $p_{delta_0} < alpha$. And since $p_{delta}<p_{delta_0}$ for any $delta>delta_0$, we can conclude that $p_{delta}<alpha$ for any $delta>delta_0$. Thus, we can not only reject this null of $delta_0$, but we can reject any null with a larger $delta$. It is only because of this ability to get an upper bound on p that we don't need a specific value for $delta$. If we just take "$delta$ is larger than zero" as our null hypothesis, without any lower bound for $delta$, then there is no upper bound for p, and so we cannot conclude that it is lower than $alpha$.
1
You seem quite confused about how two one-sided tests for equivalence work: "How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_{0}$." is not remotely close to these procedures.
– Alexis
yesterday
@Alexis That is the rigorous mathematical theoretical foundation of the process. Certainly, there are people doing statistics in the field that are not engaging in full rigor.
– Acccumulation
yesterday
add a comment |
I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show.
That is not accurate explanation of the null hypothesis. The null hypothesis is simply a hypothesis that consists of a specific distribution from which probabilities can be calculated. The reason we use $mu_1=mu_2$ as the null hypothesis has nothing to do with whether this is the "common" belief. It's used as the null hypothesis because if we hypothesize that the mean is a specific value, then given a particular set of data we can calculate the probability of seeing that data. We can't use $mu neq mu_2$ as our null hypothesis because there's no way to calculate p-values based simply on the hypothesis that the means aren't equal to a particular value. Consider the following problem:
The weights of apples have a standard deviation of 5 grams. The mean is not equal to 100. What is the probability of seeing an apple with a weight of 110 grams?
There's no way to answer that, because simply being told what the mean isn't is not enough to calculate probabilities.
Björn suggests testing the hypothesis that the difference in means is greater than some $delta_0$. How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_0$. Then once you have the data, you can calculate the p-value given that $delta_0$. Call that $p_{delta_0}$. If the difference in sample means is less than $delta_0$, then the the p-value would have been even smaller than $p_{delta_0}$ if we had chosen $delta$ to be larger than $delta_0$. We reject the null if the p-value is less than $alpha$, so if we're rejecting under that null, that means that $p_{delta_0} < alpha$. And since $p_{delta}<p_{delta_0}$ for any $delta>delta_0$, we can conclude that $p_{delta}<alpha$ for any $delta>delta_0$. Thus, we can not only reject this null of $delta_0$, but we can reject any null with a larger $delta$. It is only because of this ability to get an upper bound on p that we don't need a specific value for $delta$. If we just take "$delta$ is larger than zero" as our null hypothesis, without any lower bound for $delta$, then there is no upper bound for p, and so we cannot conclude that it is lower than $alpha$.
1
You seem quite confused about how two one-sided tests for equivalence work: "How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_{0}$." is not remotely close to these procedures.
– Alexis
yesterday
@Alexis That is the rigorous mathematical theoretical foundation of the process. Certainly, there are people doing statistics in the field that are not engaging in full rigor.
– Acccumulation
yesterday
add a comment |
I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show.
That is not accurate explanation of the null hypothesis. The null hypothesis is simply a hypothesis that consists of a specific distribution from which probabilities can be calculated. The reason we use $mu_1=mu_2$ as the null hypothesis has nothing to do with whether this is the "common" belief. It's used as the null hypothesis because if we hypothesize that the mean is a specific value, then given a particular set of data we can calculate the probability of seeing that data. We can't use $mu neq mu_2$ as our null hypothesis because there's no way to calculate p-values based simply on the hypothesis that the means aren't equal to a particular value. Consider the following problem:
The weights of apples have a standard deviation of 5 grams. The mean is not equal to 100. What is the probability of seeing an apple with a weight of 110 grams?
There's no way to answer that, because simply being told what the mean isn't is not enough to calculate probabilities.
Björn suggests testing the hypothesis that the difference in means is greater than some $delta_0$. How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_0$. Then once you have the data, you can calculate the p-value given that $delta_0$. Call that $p_{delta_0}$. If the difference in sample means is less than $delta_0$, then the the p-value would have been even smaller than $p_{delta_0}$ if we had chosen $delta$ to be larger than $delta_0$. We reject the null if the p-value is less than $alpha$, so if we're rejecting under that null, that means that $p_{delta_0} < alpha$. And since $p_{delta}<p_{delta_0}$ for any $delta>delta_0$, we can conclude that $p_{delta}<alpha$ for any $delta>delta_0$. Thus, we can not only reject this null of $delta_0$, but we can reject any null with a larger $delta$. It is only because of this ability to get an upper bound on p that we don't need a specific value for $delta$. If we just take "$delta$ is larger than zero" as our null hypothesis, without any lower bound for $delta$, then there is no upper bound for p, and so we cannot conclude that it is lower than $alpha$.
I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show.
That is not accurate explanation of the null hypothesis. The null hypothesis is simply a hypothesis that consists of a specific distribution from which probabilities can be calculated. The reason we use $mu_1=mu_2$ as the null hypothesis has nothing to do with whether this is the "common" belief. It's used as the null hypothesis because if we hypothesize that the mean is a specific value, then given a particular set of data we can calculate the probability of seeing that data. We can't use $mu neq mu_2$ as our null hypothesis because there's no way to calculate p-values based simply on the hypothesis that the means aren't equal to a particular value. Consider the following problem:
The weights of apples have a standard deviation of 5 grams. The mean is not equal to 100. What is the probability of seeing an apple with a weight of 110 grams?
There's no way to answer that, because simply being told what the mean isn't is not enough to calculate probabilities.
Björn suggests testing the hypothesis that the difference in means is greater than some $delta_0$. How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_0$. Then once you have the data, you can calculate the p-value given that $delta_0$. Call that $p_{delta_0}$. If the difference in sample means is less than $delta_0$, then the the p-value would have been even smaller than $p_{delta_0}$ if we had chosen $delta$ to be larger than $delta_0$. We reject the null if the p-value is less than $alpha$, so if we're rejecting under that null, that means that $p_{delta_0} < alpha$. And since $p_{delta}<p_{delta_0}$ for any $delta>delta_0$, we can conclude that $p_{delta}<alpha$ for any $delta>delta_0$. Thus, we can not only reject this null of $delta_0$, but we can reject any null with a larger $delta$. It is only because of this ability to get an upper bound on p that we don't need a specific value for $delta$. If we just take "$delta$ is larger than zero" as our null hypothesis, without any lower bound for $delta$, then there is no upper bound for p, and so we cannot conclude that it is lower than $alpha$.
answered yesterday
AcccumulationAcccumulation
1,54626
1,54626
1
You seem quite confused about how two one-sided tests for equivalence work: "How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_{0}$." is not remotely close to these procedures.
– Alexis
yesterday
@Alexis That is the rigorous mathematical theoretical foundation of the process. Certainly, there are people doing statistics in the field that are not engaging in full rigor.
– Acccumulation
yesterday
add a comment |
1
You seem quite confused about how two one-sided tests for equivalence work: "How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_{0}$." is not remotely close to these procedures.
– Alexis
yesterday
@Alexis That is the rigorous mathematical theoretical foundation of the process. Certainly, there are people doing statistics in the field that are not engaging in full rigor.
– Acccumulation
yesterday
1
1
You seem quite confused about how two one-sided tests for equivalence work: "How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_{0}$." is not remotely close to these procedures.
– Alexis
yesterday
You seem quite confused about how two one-sided tests for equivalence work: "How that would work is to take the null hypothesis as that the difference is equal to exactly $delta_{0}$." is not remotely close to these procedures.
– Alexis
yesterday
@Alexis That is the rigorous mathematical theoretical foundation of the process. Certainly, there are people doing statistics in the field that are not engaging in full rigor.
– Acccumulation
yesterday
@Alexis That is the rigorous mathematical theoretical foundation of the process. Certainly, there are people doing statistics in the field that are not engaging in full rigor.
– Acccumulation
yesterday
add a comment |
Thanks for contributing an answer to Cross Validated!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f386317%2ftwo-sample-t-test-to-show-equality-of-the-two-means%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
1
The alternative hypothesis indicates what an extreme result might look like. The problem with your (2) formulation is that this would be a difference in means close to $0$; so if you took a commonly used significance level of $5%$ then the power of the test (its ability to reject the null hypothesis when it is false) would never be above $5%$ no matter how large the sample size. This is not good
– Henry
2 days ago
"I learned in school that the null hypothesis should always represent the "common" belief and the alternative hypothesis should represent the change that I would like to show." Then you were taught to commit to confirmation bias as a mode of scientific inquiry.
– Alexis
yesterday