On Graphical Models for
Communications and Machine Learning:
Algorithms, Bounds, and Analog Implementation
PhD. Thesis, Justin Dauwels
Final draft PDF file, 4.859 kB.
Slides of the defense PDF file, 4.212kB.
Abstract
This
dissertation is about a specific problem and about general methods. The
specific problem is carrier-phase synchronization, which appears in the
context of digital communications. The general methods are
message-passing algorithms operating on graphical models, in
particular, factor graphs. We consider applications of such algorithms
in the context of statistical inference (as in communications, signal
processing, and machine learning), statistics, information theory, and
the theory of dynamical systems (such as analog electronic circuits).
The primary motivation for this work was (1) to analyze the degradation
of digital communications systems due to oscillator non-idealities; (2)
the development of synchronization algorithms that minimize this
performance degradation.

Clocks are
ubiquitous in digital communications systems; real-life clocks are
noisy, i.e., their signals are not perfectly periodic, which often
leads to a significant degradation of the performance of communications
systems. In the early days of communications, this source of
degradation was only of secondary concern. Communications systems used
to operate far from the ultimate performance bound, i.e., channel
capacity. The main concern was therefore to develop error-correcting
techniques that could close the gap between the performance of
practical communications systems and channel capacity.
With the recent advent of iterative decoding techniques, communications
systems nowadays most often operate close to the ultimate performance
limits; issues such as synchronization, which were earlier only of
secondary importance, have now become the mayor (remaining) bottlenecks
in the design of communications systems.
In this dissertation, we focus on carrier-phase synchronization, i.e.,
the alignment of the phase of the local oscillator in the receiver to
the phase of the incoming carrier. The questions we address are:
1. Which physical mechanisms are responsible for phase noise? How can phase noise be modeled?
2. How can carrier-phase estimation algorithms systematically be derived?
3. What are the
ultimate limits for communication over channels with phase noise?
In particular:
a. How much does the information rate of a communications channel decrease due to phase noise?
b. How well can the (noisy) carrier phase be estimated?
In contrast to earlier and parallel work, our aim is not the design and
optimization of fully operating communications systems. In this thesis,
various tools are developed that lead (or may lead) to an answer to the
above questions (and many other related questions).
We give a detailed analysis of phase noise in free-running clocks and
PLLs (Question 1). We propose a simple intuitive model for phase noise
in free-running oscillators. We describe two simple models for passband
communications channels. The models take phase offsets into account
between the received carrier and the local carrier in the receiver, but
disregard timing offsets. In the first model, the phase is constant, in
the second, the phase performs a random walk. We investigate under
which conditions the two models are valid. Most methods of this thesis
will be illustrated by means of both channel models.
Most methods we propose in this dissertation are based on graphical
models, more precisely, factor graphs. Factor graphs are used to
visualize the structure of the system at hand. They represent the
factorization of multivariate functions. Each edge in the graph
corresponds to a variable, each node corresponds to a factor. Factor
graphs can represent any function, in particular, probabilistic models,
error-correcting codes, block diagrams and other common models in
communications, signal processing and beyond.
We show how factor graphs can be used as a tool to develop practical
estimation and detection algorithms. Our techniques can be applied to
model-based signal processing (e.g., phase estimation) and machine
learning. In particular, we formulate several standard
signal-processing and machine-learning algorithms as message passing on
factor graphs, e.g., particle methods, gradient methods, decision-based
methods, and expectation maximization. In all those algorithms, local
rules are applied at the nodes in a factor graph. In other words, the
(global) estimation and detection problem is tackled by a
divide-and-conquer strategy: the global computation is carried out by
multiple (simple) local computations. The local message update rules
may be used as building blocks for novel estimation and detection
algorithms. By listing the possible update rules at each node in the
factor graph, one can systematically explore novel algorithms. We
demonstrate this idea by deriving phase estimation algorithms for the
constant-phase model and the random-walk phase model (Question 2). We
also show how the back-propagation algorithm for the training of
feed-forward neural networks follows by applying generic
message-passing rules on a suitable factor graph. We elaborate on the
computation of kernels in the light of message passing on factor graphs.
We demonstrate how
message-passing algorithms for inference (in particular,
PN-synchronization) can be implemented as dynamical systems, in
particular, as clock-free analog electronic circuits. Those systems
operate in continuous time, and do not require a digital clock;
therefore, they circumvent the problem of timing synchronization.
We present a numerical algorithm to compute the information rate of
continuous channels with memory (Question 3.a). The algorithm is an
extension of the methods proposed earlier for discrete channels with
memory. Also here, factor graphs and the summary-propagation algorithm
are key ingredients. We apply the method to the random-walk phase model.
A numerical algorithm is proposed for computing the capacity (or lower
bounds on capacity) of continuous memoryless channels (Question 3.a).
We present numerical results for the Gaussian channel with
average-power and/or peak-power constraints. We outline how the
algorithm can be extended to continuous channels with memory (e.g.,
channels with phase noise) by means of message-passing techniques.
We propose message-passing algorithms to compute Cramér-Rao-type
bounds. Cramér-Rao-type bounds are lower bounds on the minimum mean
square estimation error; the bounds may be used to asses the
performance of practical (message-passing) estimation algorithms, in
particular, our phase-estimation algorithms (Question 3.b). The
algorithms we propose for computing Cramér-Rao-type bounds open the
door to exciting applications of information geometry, such as (1)
natural-gradient-based algorithms; (2) the computation of Fisher
kernels.
Defense
Thursday 1st of December 2005
Time: 2pm
Place: ETZ E81
Slides of the
defense. PDF file, 4.212kB.
Examiners
Prof. Dr. Hans-Andrea Loeliger (ETH Zurich, Institute for Signal and Information Processing)
Prof. Dr. Shun-ichi Amari (RIKEN, Brain Science Institute, Wako-shi, Japan)
Prof. Dr. Marc Moeneclaey (Digital Communications Lab, Gent University, Gent, Belgium)
Dr. Jonathan Yedidia (Mitsubishi Electrics Research Lab, Cambridge, MA, US)