If you want to skip the backstory and jump right to the rules to follow to determine whether nodes in a DAG are independent, go to The Find.
⏪ Prelude
I recently had to do a homework on Bayesian Networks and determine whether pairs of nodes were conditionally independent to a set of nodes. Prior to this homework, in an introductory course, I had heard the terms d-separation and d-connection thrown around and knew they had something to do with independence. Perhaps because it was introductory course, I hadn’t bothered actually looking into what they meant back then. So, as I often do, I set about trying to find info - outside course notes - that would untangle the link between d-connection and blocked paths .🔍 The Search
After a bit of digging, I stumbled upon an online website that seemed to be, from its title, the ultimate won’t-waste-your-time and succinct resource: D-separation without tears.
Unfortunately, before I even had the chance to shed any tears, I realized that the resource wasn’t all that concise. So I continued my search, slogging, plodding and plowing until, at last, I found two sets of slides that had all I needed to write the meat of this post. Check out the References section to find the links to them.
📖 The Find
Here’s a condensed summary of the important information I dug up. It assumes that the reader knows what nodes, edges and conditional independence are.
Remark 1. For concision, paths are often represented as sequences of nodes, instead of sequences of edges.
Example 1. In the graph just above,
- is a collider in the path because it has two incoming edges and .
- is a collider in the path because it has two incoming edges and .
- is not a collider (it is a non-collider) in the path .
- and are paths between and but is not.
- there is a collider in the path such that neither the collider nor any of its descendants is in .
- there is a non-collider in the path that belongs to .
Example 3. Let . In Network Graph 1, the path is not blocked by because neither condition 2 can be met, since is empty, nor can condition 1 be met, since none of the nodes in the path is a collider.
Now that you’ve hopefully understood the first four definitions, you’re ready to apply the rule to determine whether nodes are conditionally independent.
Remark 2. The theorem implies that testing for conditional dependence, that is , only requires that one path be “not” blocked.
Remark 3. If there is no set of nodes conditioned on, then in the theorem.
The only piece of the puzzle left is to understand d-separation and d-connection. Here are their definitions.
🏋️ Exercises
The following exercises have to do with the network graph below. Please note that to make the notation lighter, I drop the curly braces surrounding sets.
False. There are no nodes nodes conditioned on. Therefore, any path between and must satisfy the first condition to be blocked. Consider the path . Since there are no colliders in , condition 1 cannot be satisfied, by default. Hence, since just this one path is blocked, then .
True. Again, it’s simply a matter of going through each path and verifying whether it’s blocked. The paths are:
- : The path is blocked because is a non-collider and it is conditioned on so condition 2 is satisfied.
- : The path is blocked because is a collider in the path that satisfies condition 1.
- : The path is blocked because is a collider in the path that satisfies condition 1.
- : The path is blocked because is a collider in the path that satisfies condition 1.
- : The path is blocked because is a collider in the path that satisfies condition 1.
True. Same reasoning as for Exercise 2.
True. Same reasoning as for Exercise 2.
False. The path is not blocked.
False. The path is not blocked.
True. All paths are blocked. The paths are:
- : Blocked because is a non-collider that is conditioned on so condition 2 is satisfied.
- : Blocked because is a collider that satisfies condition 1.
- : Ditto.
- : Blocked because is a non-collider that is conditioned on so condition 2 is satisfied.
- : Blocked because is a collider that satisfies condition 1.
False. The path is not blocked.
Invert the edge between B and F.