Bayes Nets & Gibbs Sampling Tutorial for CS6601 Assignment 3

Introduction: Why Bayes Nets Matter in Modern AI

Bayesian networks have become a cornerstone of artificial intelligence, enabling machines to reason under uncertainty. From self-driving cars predicting pedestrian movements to AI chatbots understanding ambiguous queries, Bayes nets help models make sense of incomplete data. In this tutorial, we'll build a Bayes net for a James Bond spy mission and use Gibbs sampling—a Markov Chain Monte Carlo (MCMC) method—to answer probability questions. This mirrors the core concepts in CS6601 Assignment 3: probabilistic modeling and sampling.

Understanding Bayesian Networks

A Bayesian network is a directed acyclic graph where nodes represent random variables and edges encode conditional dependencies. Each node has a conditional probability table (CPT) that quantifies the relationship with its parents. The network allows efficient computation of joint probabilities using the chain rule.

Key Components

Nodes: Variables like H (hire hackers), C (buy Contra), M (hire mercenaries), B (Bond guarding M), Q (hack Q's database), K (kidnap M), D (access Double-0 files).
Edges: Dependencies: H and C influence Q; M and B influence K; Q and K influence D.
CPTs: Probabilities from MI6 data: P(H)=0.5, P(C)=0.3, P(¬M)=0.2 so P(M)=0.8, P(B)=0.5, P(Q|H,C) given, P(K|M,B) given, P(D|Q,K) deterministic (AND gate).

Building the Bayes Net for the MI6 Mission

Let's design the network step by step. The variables are:

H: Spectre hires hackers (True/False)
C: Spectre buys Contra (True/False)
M: Spectre hires mercenaries (True/False)
B: Bond guards M (True/False)
Q: Q's database hacked (True/False)
K: M kidnapped (True/False)
D: Double-0 files accessed (True/False)

From the assignment: P(H)=0.5, P(C)=0.3, P(¬M)=0.2 → P(M)=0.8, P(B)=0.5. The CPTs: P(Q|H,C): if H=True, C=True → 0.9; H=True, C=False → 0.55; H=False, C=True → 0.25 (since 1-0.75); H=False, C=False → 0.05 (1-0.95). P(K|M,B): if M=True, B=True → 0.15 (1-0.85); M=True, B=False → 0.9 (given); M=False, B=True → 0.01 (assumed low); M=False, B=False → 0.05 (assumed). P(D|Q,K): D is true only if both Q and K are true (AND).

Gibbs Sampling: A Practical Approach

Gibbs sampling is an MCMC algorithm that generates samples from the joint distribution by iteratively sampling each variable conditioned on the current values of all others (its Markov blanket). For large networks, exact inference is intractable, but Gibbs sampling provides approximate answers.

How Gibbs Sampling Works

Initialize all variables randomly (e.g., all False).
For each variable in turn, compute its conditional probability given the current state of its Markov blanket.
Sample a new value for that variable from this distribution.
Repeat for many iterations. After a burn-in period, samples approximate the true posterior.

In our spy mission, we might want to know: given that the Double-0 files were accessed (D=True), what is the probability that Spectre used Contra (C=True)? This is a posterior query P(C|D=True).

Trend Connection: AI and Cybersecurity

Bayesian networks are increasingly used in cybersecurity to model attack paths and predict breaches—just like in this MI6 scenario. With the rise of AI-powered cyberattacks (e.g., deepfake phishing), probabilistic models help defenders anticipate threats. Gibbs sampling enables real-time threat assessment when exact calculations are too slow.

Step-by-Step Example: Querying the Network

Let's walk through a Gibbs sampling run for P(C|D=True). We'll simulate a few iterations manually.

Initialization

Set all variables to False: H=False, C=False, M=False, B=False, Q=False, K=False, D=False.

Iteration 1

Sample H given its Markov blanket (parents: none; children: Q). Current Q=False. P(H=True|Q=False) = P(H=True) * P(Q=False|H=True,C=False) / P(Q=False) = 0.5 * 0.45 / (0.5*0.45 + 0.5*0.95) ≈ 0.321. Suppose random number 0.6 > 0.321 → H=False.
Sample C similarly. P(C=True|Q=False) = 0.3 * 0.75 / (0.3*0.75 + 0.7*0.95) ≈ 0.253. Random 0.4 → C=False.
Sample M: P(M=True|K=False) = 0.8 * P(K=False|M=True,B=False) / ... = 0.8 * 0.1 / (0.8*0.1 + 0.2*0.95) ≈ 0.296. Random 0.8 → M=False.
Sample B: P(B=True|K=False) similar, assume stays False.
Sample Q: P(Q=True|H=False,C=False) = 0.05. Random 0.9 → Q=False.
Sample K: P(K=True|M=False,B=False) = 0.05. Random 0.7 → K=False.
D is deterministic: D=False because Q=False.

Iteration 2

We want D=True, so we condition on D=True. This changes the sampling: we must reject samples where D=False. In practice, we use rejection sampling or condition by clamping D=True. For Gibbs, we clamp D=True and sample other variables from their conditional given D=True and the current values.

With D=True, Q and K must be True. So we set Q=True, K=True. Now sample H given Q=True: P(H=True|Q=True) = 0.5 * 0.55 / (0.5*0.55 + 0.5*0.25) ≈ 0.6875. Random 0.4 → H=True. Sample C given Q=True: P(C=True|Q=True) = 0.3 * 0.9 / (0.3*0.9 + 0.7*0.55) ≈ 0.412. Random 0.5 → C=False. Sample M given K=True: P(M=True|K=True) = 0.8 * 0.9 / (0.8*0.9 + 0.2*0.05) ≈ 0.986. Random 0.2 → M=True. Sample B given K=True: P(B=True|K=True) = 0.5 * 0.15 / (0.5*0.15 + 0.5*0.9) ≈ 0.143. Random 0.8 → B=False.

After many iterations, count how often C=True when D=True. This ratio approximates P(C=True|D=True).

Why Gibbs Sampling Beats Simple Sampling

Traditional Monte Carlo sampling fails when events are rare (e.g., D=True). Gibbs sampling efficiently explores the posterior by focusing on high-probability regions. In AI, this is crucial for training deep learning models with Bayesian layers or for real-time inference in robotics.

Practical Tips for CS6601 Assignment 3

Implement the Bayes net structure and CPTs exactly as given.
For Gibbs, compute conditional probabilities using the Markov blanket formula: P(X|MB(X)) ∝ P(X|parents) * Π P(child|parents).
Use a burn-in period (e.g., 1000 samples) to let the chain converge.
Collect samples after burn-in to estimate probabilities.
Test your code on simple queries (e.g., P(D=True) by sampling without evidence).

Conclusion

Bayesian networks and Gibbs sampling are powerful tools for reasoning under uncertainty—skills every AI engineer needs. By modeling a James Bond mission, you've seen how probability theory applies to real-world security problems. As AI continues to evolve, these techniques will only grow in importance, from self-driving cars to medical diagnosis. Master them now, and you'll be ready for the future of intelligent systems.