Here is a link to the program. Remote participation is possible (link in the website), and in-person participation is free but we ask people to register so we can print badges and order the appropriate number of coffee breaks.

This workshop is being organized in partnership with EDGE, an Italian NGO that works on LGBT rights, and it is the first event of their initiative “A+I: Algoritmi + Inclusivi”, which will feature an awareness campaign and a series of video interviews that will start after the summer.

In next week’s workshop, Oreste Pollicino from Bocconi will talk about the perspective of the legal community around algorithmic discrimination, Symeon Papadopoulos from ITI Patras will give a survey on issues of fairness in image processing and image understanding, Sanghamitra Dutta from J.P. Morgan AI will talk about how to use the theory of causality to reason about fairness, Debora Nozza and Dirk Hovy from Bocconi will talk about issues of fairness in language models and natural language processing, and Omer Reingold from Stanford and Cynthia Dwork from Harvard will talk about modeling and achieving fairness in prediction models.

The last morning session will be a panel discussion moderated by Damiano Terziotti from EDGE about perspectives from the social sciences and from outside academia. It will feature, among others, Brando Benifei, a member of the EU parliament who has played a leading role in the 2021 draft EU regulations on AI. The other panel members are Alessandro Bonaita, who is a data science lead in Generali (Italy’s largest insurance company), Luisella Giani, who is leading a technology consulting branch of Oracle for Europe, Middle East and Africa, Cinzia Maiolini, who is in the national secretariat of CGIL, an Italian Union, and Massimo Airoldi from the University of Milan.

If you are in or near Milan next week, come to what is shaping up to be a memorable event!

]]>- where: Bocconi University’s Roentgen Building (via Roentgen 1, Milano), Room AS01
- when: June 15-18
- what: talks on cryptography and graph algorithms, including two hours devoted to Max Flow in nearly-linear time
- how: register for free

Italy has a wonderfully named, and well-known within the country, National Academy of Arts and Science, the Accademia dei Lincei, which means something like academy of the “eagle-eyed” (literally, lynx-eyed), that is, people that can see far. The Accademia dei XL is much less well known, although it has a distinguished 240-year history, during which people like Guglielmo Marconi and Enrico Fermi were members. More recently, the much beloved Rita Levi-Montalcini, Holocaust survivor, Nobel Laureate, and Senator-for-life, was a member. Current members include Nobel Laureates Carlo Rubbia and Giorgio Parisi. Noted algebraist Corrado De Concini is the current president.

Be that as it may, the academicians did vote to make me a member, their first computer scientist ever. Next week, at the inauguration of their 240th academic year, I will speak to the other members about randomness and pseudorandomness in computation.

]]>If you would like to come to Italy a few days in advance, Alon Rosen and I are organizing two co-locating workshops on graph algorithms and on cryptography in Milan on June 15-18 (details forthcoming). If you want to stay longer, I am organizing a mini-workshop on fairness in AI in Milan on June 27 (more details about it in a few days). Registration will be free for both events. There are several high-speed trains every day between Rome and Milan, taking about 3 hours.

**Call for Participation **

**54th ACM Symposium on Theory of Computing (STOC 2022) – Theory Fest **

**June 20-24, 2022 **

**Rome, Italy **

The 54th ACM Symposium on Theory of Computing (STOC 2022) is sponsored by the ACM Special Interest Group on Algorithms and Computation Theory and will be held in Rome, Italy, Monday June 20 – Friday, June 24, 2022.

STOC 2022 – Theory Fest will feature technical talk sessions, 6 workshops with introductory tutorials, poster sessions, social events, and a special joint session with “Accademia Nazionale dei Lincei”, the oldest and most prestigious Italian academic institution, followed by a reception and a concert at the Academy historic site.

**Registration**

STOC 2022 registration is available here.

**Early registration deadline: April 30th. **

STOC 2022 is sponsored by Algorand, Amazon, Apple, Google, IOHK, Microsoft, Sapienza University of Rome.

]]>The new Sapienza computer science department was founded mostly by faculty from the Sapienza mathematics department, plus a number of people that came from other places to help start it. Among the latter, Renato Capocelli had moved to Rome from the University of Salerno, where he had been department chair of computer science.

Capocelli worked on combinatorics and information theory. In the early 90s, he had also become interested in the then-new area of zero-knowledge proofs.

Capocelli taught the information-theory course that I was attending, and it was a very different experience from the classes I had attended up to that point. To get the new major started, several professors were teaching classes outside their area, sticking close to their notes. Those teaching mathematical classes, were experts but were not deviating from the definition-theorem-proof script. Capocelli had an infectious passion for his subject, took his time to make us gain an intuitive understanding of the concepts of information theory, was full of examples and anecdotes, and always emphasized the high-level idea of the proofs.

I subsequently met several other charismatic and inspiring computer scientists and mathematicians, though Capocelli had a very different personality from most of them. He was like an earlier generation of Southern Italian intellectuals, who could be passionate about their subject in a peculiarly non-nerdy way, loving it the way one may love food, people, nature, or a full life in general.

On April 8, 1992, Renato Capocelli died suddenly and unexpectedly, though his memory lives on in the many people he inspired. The Computer Science department of the University of Salerno was named after him for a period of time.

]]>A few weeks ago, we were joined by Francesca Buffa and Marc Mezard.

Francesca, a computational biologist formerly at Oxford medical school, is now the fourth out of four computer science tenured faculty in our new department to have an active ERC grant.

Marc’s work has spanned theoretical physics, information theory and computation, including his collaboration with Giorgio Parisi’s Nobel Prize winning work, and he has been most recently the president of the Ecole National Superieure in Paris. When we asked for letters for his tenure case, one of the reviewers wrote, more or less in so many words, “you would be lucky to have Marc in your university, though it’s very unlikely that he will accept your offer”. At that point Marc had already accepted.

]]>新年快乐！

]]>Some details are here. Candidates must apply online by January 15 (end of day Central Europe time) for the application to be considered. To apply online, go to https://jobmarket.unibocconi.eu/ and look at the only opening that has a Jan 15 expiration (currently it is at the top of the list). The negotiable start date is September, 2022. By that time the new Computing Sciences department will be fully operational.

We are interested in all areas of computer science. Alon Rosen, Dirk Hovy and I are very happy to talk to prospective candidates about what the university is like and what are its plans for developing computer science.

The university pays internationally competitive salary and provides relocation assistance. The language of instructions of all computer science courses at both undergraduate and graduate levels is English.

Scholars of any nationality who have not lived in Italy for the past two years and who move to Italy to take a university tenure-track or tenured position pay almost no income tax for six years, or more if they buy a home in Italy and/or have children under the age of 18.

For Italians working in Italy: this position is governed by a private-law contract with Bocconi, which is not the same as a RDTA or RDTB position, although the terms are similar.

Subject to a successful mid-term review (which usually happens after three years), and a successful tenure review (which happens within five years from the mid-term review, or possibly earlier depending on the background of the candidate), assistant professors are promoted to associate professors with tenure. (For those familiar with the Italian system, the latter positions are fully recognized as professore associato by the ministry of university.)

]]>**1. Matrix Multiplicative Weights Update **

In this post we consider the following generalization, introduced and studied by Arora and Kale, of the “learning from expert advice” setting and the multiplicative weights update method. In the “experts” model, we have a repeated game in which, at each time step , we have the option of following the advice of one of experts; if we follow the advice of expert at time , we incur a loss of , which is unknown to us (although, at time we know the loss functions ). We are allowed to choose a probabilistic strategy, whereby we follow the advice of expert with probability , so that our expected loss at time is .

In the matrix version, instead of choosing an expert we are allowed to choose a unit -dimensional vector , and the loss incurred in choosing the vector is , where is an unknown symmetric matrix. We are also allowed to choose a probabilistic strategy, so that with probability we choose the unit vector , and we incur the expected loss

The above expression can also be written as

where and we used the Frobenius inner product among square matrices defined as . The matrices that can be obtained as convex combinations of rank-1 matrices of the form where is a unit vector are called *density matrices* and can be characterized as the set of positive semidefinite matrices whose trace is 1.

It is possible to see the above game as the “quantum version” of the experts settings. A choice of a unit vector is a *pure quantum state*, a probability distribution of pure quantum states, described by a density matrix, is a *mixed quantum state*. If is a density matrix describing a mixed quantum state, is a symmetric matrix, and is the spectral decomposition of in terms of its eigenvalues and orthonormal eigenvectors , then is the expected outcome of a measurement of in the basis , and such that is the value of the measurement if the outcome is .

If you have no idea what the above paragraph means, that is perfectly ok because this view will not be particularly helpful in motivating the algorithm and analysis that we will describe. (Here I am reminded of the joke about the way people from Naples give directions: “How do I get to the post office?”, “Well, you see that road over there? After the a couple of blocks there is a pharmacy, where my uncle used to work, though now he is retired.” “Ok?” “Now, if you turn left after the pharmacy, after a while you get to a square with a big fountain and the church of St. Anthony where my niece got married. It was a beautiful ceremony, but the food at the reception was not great.” “Yes, I know that square”, “Good, don’t go there, the post office is not that way. Now, if you instead take that other road over there …”)

The main point of the above game, and of the Matrix Multiplicative Weights Update (MMWU) algorithm that plays it with bounded regret, is that it provides useful generalizations of the standard “experts” game and of the Multiplicative Weights Update (MWU) algorithm. For example, as we have already seen, MWU can provide a “derandomization” of the Chernoff bound; we will see that MMWU provides a derandomization of the *matrix* Chernoff bound. MWU can be used to approximate certain Linear Programming problems; MMWU can be used to approximate certain *Semidefinite Programming* problems.

To define and analyze the MMWU algorithm, we need to introduce certain operations on matrices. We will always work with real-valued symmetric matrices, but everything generalizes to complex-valued Hermitian matrices. If is a symmetric matrix, are the eigenvalues of , and are corresponding orthonormal eigenvectors, then we will define a number of operations and functions on that operate on the eigenvalues while leaving the eigenvectors unchanged.

The first operation is *matrix exponentiation*: we define

The operation always defines a positive definite matrix, and the resulting matrix satisfies a “Taylor expansion”

Indeed, it is more common to use the above expansion as the definition of the matrix exponential, and then derive the expression in terms of eigenvalues.

We also have the useful bounds

which is true for every and

which is true for all such that .

Analogously, if is positive definite, we can define

and we have a number of identities like , , , where is a scalar. We should be careful, however, not to take the analogy with real numbers too far: for example, if and are two symmetric matrices, in general it is not trues that , in fact the above expression is actually always false except when and commute, in which case it is trivially true. We have, however, the following extremely useful fact.

Theorem 1 (Golden-Thompson Inequality)

The Golden-Thompson inequality will be all we need to generalize to this matrix setting everything we have proved about multiplicative weights. See this post by Terry Tao for a proof.

The *Von Neumann entropy* of a density matrix with eigenvalues is defined as

that is, if we view as the mixed quantum state in which the pure state has probability , then is the entropy of the distribution over the pure states. Again, this is not a particularly helpful point of view, and in fact we will be interested in defining not just for density matrices but for arbitrary positive definite matrices, and even positive semidefinite (with the convention that , which is used also in the standard definition of entropy of a distribution).

We will be interested in using Von Neumann entropy as a regularizer, and hence we will want to know what is its Bregman divergence. Some calculations show that the Bregman divergence of the Von Neumann entropy, which is called the quantum relative entropy, is

If and are density matrices, the terms cancel out; the above definition is valid for arbitrary positive definite matrices.

We will have to study the minima of various functions that take a matrix as an input, so it is good to understand how to compute the gradient of such functions. For example what is the gradient of the function ? Working through the definition we see that , and indeed we always have that the gradient of the function is everywhere. Somewhat less obvious is the calculation of the gradient of the Von Neumann entropy, which is

**2. Analysis in the Constrained FTRL Framework **

Suppose that we play that we described above using agile mirror descent and using negative Von Neumann entropy (appropriately scaled) as a regularizer. That is, for some that we will choose later, we use the regularizer

which has the Bregman divergence

and our feasible set is the set of density matrices

To bound the regret, we just have to plug the above definitions into the machinery that we developed in our fifth post.

At time 1, we play the identity matrix scaled by n, which is a density matrix of maximum Von Neumann entropy :

At time , we play the matrix obtained as

and recall that we proved that, after steps,

If is a density matrix with eigenvalues , then the first term is

To complete the analysis we have to understand . We need to compute the gradient and set it to zero. The gradient of is just . The gradient of is

Meaning that we want to solve for

and satisfies

and we can write

Then we can use Golden-Thompson and the fact that , which holds if , to write

Combining everything together we have

and so, provided ,

This is the best bound we can hope for, and it matches Theorem 1 in our first post about the Xultiplicative Weights Update algorithm.

If we have , we can simplify it to

where the last step comes from optimizing .

We can also write, under the condition ,

where is the “absolute value” of the matrix defined in the following way: if is a symmetric matrix, then its absolute value is . Allen-Zhu, Liao and Orecchia state the analysis in this way in their on generalizations of Matrix Multiplicative Weights.

Our next post will discuss applications at length, but for now let us gain a bit of intuition about the usefulness of these regret bounds. Recall that, for every symmetric matrix , we have

and so the regret bound can be reintepreted in the following way: if we let be the loss functions used in a game played against a MMWU algorithm, and the algorithm selects density matrices , then

that is,

provided that . For example, switching with , we have

provided that , which means that if we can choose a sequence of loss matrices that make the MMWU have small loss at each step, then we are guaranteed that the sum of such matrices cannot have any large eigenvalue.

]]>