A mutual information equality
$begingroup$
$x$ and $y$ are two random variables and $I(u;v)$ is the mutual information between random variables $u$ and $v$. Does the following equality hold?
$$text{argmax}_a I(y;ax)=text{argmin}_a I(y;y-ax).$$
information-theory
$endgroup$
add a comment |
$begingroup$
$x$ and $y$ are two random variables and $I(u;v)$ is the mutual information between random variables $u$ and $v$. Does the following equality hold?
$$text{argmax}_a I(y;ax)=text{argmin}_a I(y;y-ax).$$
information-theory
$endgroup$
add a comment |
$begingroup$
$x$ and $y$ are two random variables and $I(u;v)$ is the mutual information between random variables $u$ and $v$. Does the following equality hold?
$$text{argmax}_a I(y;ax)=text{argmin}_a I(y;y-ax).$$
information-theory
$endgroup$
$x$ and $y$ are two random variables and $I(u;v)$ is the mutual information between random variables $u$ and $v$. Does the following equality hold?
$$text{argmax}_a I(y;ax)=text{argmin}_a I(y;y-ax).$$
information-theory
information-theory
asked Jan 8 at 15:57
HansHans
4,98021032
4,98021032
add a comment |
add a comment |
1 Answer
1
active
oldest
votes
$begingroup$
It can be seen that for any $aneq 0$, $I(y;ax)=I(y;x)$ because $yrightarrow axrightarrow x$ and $yrightarrow xrightarrow ax$ are both Markov chain hence by the data processing inequality both $I(y;x)leq I(y;ax)$ and $I(y;x)geq I(y;ax)$ respectively.
Since any $aneq 0$ is a maximizer of $I(y;ax)$, your question is ill defined.
$endgroup$
$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48
$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07
$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00
$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28
$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "69"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3066344%2fa-mutual-information-equality%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
It can be seen that for any $aneq 0$, $I(y;ax)=I(y;x)$ because $yrightarrow axrightarrow x$ and $yrightarrow xrightarrow ax$ are both Markov chain hence by the data processing inequality both $I(y;x)leq I(y;ax)$ and $I(y;x)geq I(y;ax)$ respectively.
Since any $aneq 0$ is a maximizer of $I(y;ax)$, your question is ill defined.
$endgroup$
$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48
$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07
$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00
$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28
$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45
add a comment |
$begingroup$
It can be seen that for any $aneq 0$, $I(y;ax)=I(y;x)$ because $yrightarrow axrightarrow x$ and $yrightarrow xrightarrow ax$ are both Markov chain hence by the data processing inequality both $I(y;x)leq I(y;ax)$ and $I(y;x)geq I(y;ax)$ respectively.
Since any $aneq 0$ is a maximizer of $I(y;ax)$, your question is ill defined.
$endgroup$
$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48
$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07
$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00
$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28
$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45
add a comment |
$begingroup$
It can be seen that for any $aneq 0$, $I(y;ax)=I(y;x)$ because $yrightarrow axrightarrow x$ and $yrightarrow xrightarrow ax$ are both Markov chain hence by the data processing inequality both $I(y;x)leq I(y;ax)$ and $I(y;x)geq I(y;ax)$ respectively.
Since any $aneq 0$ is a maximizer of $I(y;ax)$, your question is ill defined.
$endgroup$
It can be seen that for any $aneq 0$, $I(y;ax)=I(y;x)$ because $yrightarrow axrightarrow x$ and $yrightarrow xrightarrow ax$ are both Markov chain hence by the data processing inequality both $I(y;x)leq I(y;ax)$ and $I(y;x)geq I(y;ax)$ respectively.
Since any $aneq 0$ is a maximizer of $I(y;ax)$, your question is ill defined.
edited Jan 8 at 17:09
answered Jan 8 at 16:25
P. QuintonP. Quinton
1,7261213
1,7261213
$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48
$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07
$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00
$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28
$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45
add a comment |
$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48
$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07
$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00
$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28
$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45
$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48
$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48
$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07
$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07
$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00
$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00
$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28
$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28
$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45
$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45
add a comment |
Thanks for contributing an answer to Mathematics Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3066344%2fa-mutual-information-equality%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown