Simultaneous eating algorithm

From Wikipedia, the free encyclopedia
(Redirected from Probabilistic-serial rule)

A simultaneous eating algorithm (SE)[1] is an algorithm for allocating divisible objects among agents with ordinal preferences. "Ordinal preferences" means that each agent can rank the items from best to worst, but cannot (or does not want to) specify a numeric value for each item. The SE allocation satisfies SD-efficiency - a weak ordinal variant of Pareto-efficiency (it means that the allocation is Pareto-efficient for at least one vector of additive utility functions consistent with the agents' item rankings).

SE is parametrized by the "eating speed" of each agent. If all agents are given the same eating speed, then the SE allocation satisfies SD-envy-freeness - a strong ordinal variant of envy-freeness (it means that the allocation is envy-free for all vectors of additive utility functions consistent with the agents' item rankings). This particular variant of SE is called the Probabilistic Serial rule (PS).[1]

SE was developed by Hervé Moulin and Anna Bogomolnaia as a solution for the fair random assignment problem, where the fraction that each agent receives of each item is interpreted as a probability. If the integral of the eating speed of all agents is 1, then the sum of fractions assigned to each agent is 1, so the matrix of fractions can be decomposed into a lottery over assignments in which each agent gets exactly one item. With equal eating speeds, the lottery is envy-free in expectation (ex-ante) for all vectors of utility functions consistent with the agents' item rankings.

A variant of SE was applied also to cake-cutting, where the allocation is deterministic (not random).[2]

Description[edit]

Each item is represented as a loaf of bread (or other food). Initially, each agent goes to their favourite item and starts eating it. It is possible that several agents eat the same item at the same time.

Whenever an item is fully eaten, each of the agents who ate it goes to their favorite remaining item and starts eating it in the same way, until all items are consumed.

For each item, the fraction of that item eaten by each agent is recorded. In the context of random assignments, these fractions are considered as probabilities. Based on these probabilities, a lottery is done. The type of lottery depends on the problem:

  • If each agent is allowed to receive any number of items, then a separate lottery can be done for each item. Each item is given to one of the agents who ate a part of it, chosen at random according to the probability distribution for that item.
  • If each agent should receive exactly one item, then there must be a single lottery that picks an assignment by some probability distribution on the set of deterministic assignments. To do this, the n-by-n matrix of probabilities should be decomposed into a convex combination of permutation matrices. This can be done by the Birkhoff algorithm. It is guaranteed to find a combination in which the number of permutation matrices is at most n2-2n+2.

An important parameter to SE is the eating speed of each agent. In the simplest case, when all agents have the same entitlements, it makes sense to let all agents eat in the same speed all the time. However, when agents have different entitlements, it is possible to give the more privileged agents a higher eating speed. Moreover, it is possible to let the eating speed change with time. The important thing is that the integral of the eating speed of each agent equals the total number of items that the agent should receive (in the assignment setting, each agent should get exactly 1 item, so the integral of all eating-speed functions should be 1).

Examples[edit]

There are four agents and four items (denoted w, x, y, z). The preferences of the agents are:

  • Alice and Bob prefer w to x to y to z.
  • Chana and Dana prefer x to w to z to y.

The agents have equal rights so we apply SE with equal and uniform eating speed of 1 unit per minute.

Initially, Alice and Bob go to w and Chana and Dana go to x. Each pair eats their item simultaneously. After 1/2 minute, Alice and Bob each have 1/2 of w, while Chana and Dana each have 1/2 of x.

Then, Alice and Bob go to y (their favourite remaining item) and Chana and Dana go to z (their favourite remaining item). After 1/2 minute, Alice and Bob each have 1/2 of y and Chana and Dana each have 1/2 of z.

The matrix of fractions is now:

Alice: 1/2 0 1/2 0

Bob: 1/2 0 1/2 0

Chana: 0 1/2 0 1/2

Dana: 0 1/2 0 1/2

Based on the eaten fractions, item w is given to either Alice or Bob with equal probability and the same is done with item y; item x is given to either Chana or Dana with equal probability and the same is done with item z. If it is required to give exactly 1 item per agent, then the matrix of probabilities is decomposed into the following two assignment matrices:

1 0 0 0 ||| 0 0 1 0

0 0 1 0 ||| 1 0 0 0

0 1 0 0 ||| 0 0 0 1

0 0 0 1 ||| 0 1 0 0

One of these assignments is selected at random with a probability of 1/2.

Other examples can be generated at the MatchU.ai website.

Properties[edit]

The description below assumes that all agents have risk-neutral preferences, that is, their utility from a lottery equals the expected value of their utility from the outcomes.

Efficiency[edit]

SE with any vector of eating speeds satisfies an efficiency property called SD-efficiency (also called ordinal efficiency). Informally it means that, considering the resulting probability matrix, there is no other matrix that all agents weakly-sd-prefer and at least one agent strictly-sd-prefers.

In the context of random assignments, SD-efficiency implies ex-post efficiency: every deterministic assignment selected by the lottery is Pareto-efficient.

A fractional assignment is SD-efficient if-and-only-if it is the outcome of SE for some vector of eating-speed functions.[1]: Thm.1 

Fairness[edit]

SE with equal eating speeds (called PS) satisfies a fairness property called ex-ante stochastic-dominance envy-freeness (sd-envy-free). Informally it means that each agent, considering the resulting probability matrix, weakly prefers his/her own row of probabilities to the row of any other agent. Formally, for every two agents i and j:

  • Agent i has a weakly-higher probability to get his best item in row i than in row j;
  • Agent i has a weakly-higher probability to get one of his two best items in row i than in row j;
  • ...
  • For any k ≥ 1, agent i has a weakly-higher probability to get one of his k best items in row i than in row j.

Note that sd-envy-freeness is guaranteed ex-ante: it is fair only before the lottery takes place. The algorithm is of course not ex-post fair: after the lottery takes place, the unlucky agents may envy the lucky ones. This is inevitable in allocation of indivisible objects.

PS satisfies another fairness property, in addition to envy-freeness. Given any fractional allocation, for any agent i and positive integer k, define t(i,k) as the total fraction that agent i receives from his k topmost indifference classes. This t is a vector of size at most n*m, where n is the number of agents and m is the number of items. An ordinally-egalitarian allocation is one that maximizes the vector t in the leximin order. PS is the unique rule that returns an ordinally-egalitarian allocation.[3]

Strategy[edit]

SE is not a truthful mechanism: an agent who knows that his most preferred item is not wanted by any other agent can manipulate the algorithm by eating his second-most preferred item, knowing that his best item will remain intact. The following is known about strategic manipulation of PS:

  • PS is truthful when agents compare bundles using the downward lexicographic relation.[3]
  • An agent can compute in polynomial time a best-response w.r.t. the downward lexicographic relation. When there are two agents, each agent can compute in polynomial time a best response w.r.t. expected utility. When the number of agents can vary, computing a best response w.r.t. EU is NP-hard.[4]
  • Best responses w.r.t. expected utility can cycle. However, a pure Nash equilibrium exists for any number of agents and items. When there are two agents, there are linear-time algorithms to compute a preference-profile that is in Nash equilibrium w.r.t. the original preferences. In some empirical settings, PS is less prone to manipulation. When an agent is risk-averse and has no information about the other agents' strategies, his maximin strategy is to be truthful.[4]
  • A manipulating agent can increase his utility by a factor of at most 3/2. This was first observed empirically on random instances,[5] and then proved formally.[6]

Note that the random priority rule, which solves the same problem as PS, is truthful.

Extensions[edit]

The SE algorithm has been extended in many ways.

  • Katta and Sethuraman[7] present Extended PS (EPS), which allows weak ordinal preferences (rankings with indifferences). The algorithm is based on repeatedly solving instances of parametric network flow.
  • Bogomolnaia[3] presented a simpler definition of the PS rule for weak preferences, based on the leximin order.
  • Yilmaz[8] allows both indifferences and endowments.
  • Athanassoglout and Sethuraman[9] present the controlled consuming (CC) rule, which allows indifferences and fractional endowments of any quantity.
  • Budish, Che, Kojima and Milgrom[10] present Generalized PS, which allows multiple units per item, more items than agents, each agent can get several units, upper quotas, and bi-hierarchical constraints on the feasible allocations.
  • Ashlagi, Saberi and Shameli[11] present another Generalized PS, which allows lower and upper quotas, and distributional constraints (constraints on the probability distribution and not only the final allocation).
  • Aziz and Stursberg[12] present Egalitarian Simultaneous Reservation (ESR), which allows not only fair item allocation but also general social choice problems, with possible indifferences.
  • Aziz and Brandl[13] present Vigilant Eating (VE), which allows even more general constraints.

Guaranteeing ex-post approximate fairness[edit]

As explained above, the allocation determined by PS is fair only ex-ante but not ex-post. Moreover, when each agent may get any number of items, the ex-post unfairness might be arbitrarily bad: theoretically it is possible that one agent will get all the items while other agents get none. Recently, several algorithms have been suggested, that guarantee both ex-ante fairness and ex-post approximate-fairness.

Freeman, Shah and Vaish[14] show:

  • The Recursive Probabilistic Serial (RecPS) algorithm, which returns a probability distribution over allocations that are all envy-free-except-one-item (EF1). The distribution is ex-ante EF, and the allocation is ex-post EF1. A naive version of this algorithm yields a distribution over a possibly exponential number of deterministic allocations, a support size polynomial in the number of agents and goods is sufficient, and thus the algorithm runs in polynomial time. The algorithm uses separation oracles.
  • A different algorithm, based on an ex-ante max-product allocation, which attains ex-ante group envy-freeness (GEF; it implies both EF and PO), and ex-post PROP1+EF11. This is the only allocation rule that achieves all these properties. It cannot be decomposed into EF1 allocations.
  • These combinations of properties are best possible: it is impossible to guarantee simultaneously ex-ante EF (even PROP) and ex-ante PO together with ex-post EF1; or ex-ante EF (even PROP) together with ex-post EF1 and fractional-PO.
  • The RecPS can be modified to attain similar guarantees (ex-ante EF and ex-post EF1) for bads.

Aziz[15] shows:

  • The PS-lottery algorithm, in which the allocation is ex-ante sd-EF, and the lottery is done only among deterministic allocations that are sd-EF1, i.e., the EF and EF1 guarantees hold for any cardinal utilities consistent with the ordinal ranking. Moreover, the outcome is sd-PO both ex-ante and ex-post. The algorithm uses as subroutines both the PS algorithm and the Birkhoff algorithm. The ex-ante allocation is equivalent to the one returned by PS; this shows that the outcome of PS can be decomposed into EF1 allocations.
  • With binary utilities, the PS-lottery algorithm is group-strategyproof, ex-ante PO, ex-ante EF and ex-post EF1.
  • These combinations of properties are best possible: it is impossible to guarantee simultaneously ex-ante sd-EF, ex-post EF1 and ex-post PO; or ex-ante PO and ex-ante sd-EF.
  • Checking whether a given random allocation can be implemented by a lottery over EF1 and PO allocations is NP-hard.

Babaioff, Ezra and Feige[16] show:

  • A polynomial-time algorithm for computing allocations that are ex-ante proportional, and ex-post both PROP1 and 1/2-fraction maximin-share (and also 1/2-fraction truncated-proportional share).
  • These properties are nearly optimal - it is impossible to guarantee more than PROP ex-ante, and more than n/(2n-1) truncated-proportional share ex-post.

Hoefer, Schmalhofer and Varricchio[17] extend the notion of "Best-of-Both-Worlds" lottery to agents with different entitlements.

See also[edit]

The page on fair random assignment compares PS to other procedures for solving the same problem, such as the random priority rule.

References[edit]

  1. ^ a b c Bogomolnaia, Anna; Moulin, Hervé (2001). "A New Solution to the Random Assignment Problem". Journal of Economic Theory. 100 (2): 295. doi:10.1006/jeth.2000.2710.
  2. ^ Aziz, Haris; Ye, Chun (2014). "Cake Cutting Algorithms for Piecewise Constant and Piecewise Uniform Valuations". In Liu, Tie-Yan; Qi, Qi; Ye, Yinyu (eds.). Web and Internet Economics. Lecture Notes in Computer Science. Vol. 8877. Cham: Springer International Publishing. pp. 1–14. doi:10.1007/978-3-319-13129-0_1. ISBN 978-3-319-13129-0. S2CID 18365892.
  3. ^ a b c Bogomolnaia, Anna (2015-07-01). "Random assignment: Redefining the serial rule". Journal of Economic Theory. 158: 308–318. doi:10.1016/j.jet.2015.04.008. ISSN 0022-0531.
  4. ^ a b Aziz, Haris; Gaspers, Serge; Mackenzie, Simon; Mattei, Nicholas; Narodytska, Nina; Walsh, Toby (2015-05-04). Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems. Istanbul, Turkey: International Foundation for Autonomous Agents and Multiagent Systems. pp. 1451–1459. ISBN 978-1-4503-3413-6.. Older technical report: https://arxiv.org/abs/1401.6523.
  5. ^ Hosseini, Hadi; Larson, Kate; Cohen, Robin (2018-07-01). "Investigating the characteristics of one-sided matching mechanisms under various preferences and risk attitudes". Autonomous Agents and Multi-Agent Systems. 32 (4): 534–567. arXiv:1703.00320. doi:10.1007/s10458-018-9387-y. ISSN 1573-7454. S2CID 14041902.
  6. ^ Wang, Zihe; Wei, Zhide; Zhang, Jie (2020-04-03). "Bounded Incentives in Manipulating the Probabilistic Serial Rule". Proceedings of the AAAI Conference on Artificial Intelligence. 34 (2): 2276–2283. arXiv:2001.10640. doi:10.1609/aaai.v34i02.5605. ISSN 2374-3468. S2CID 210943079.
  7. ^ Katta, Akshay-Kumar; Sethuraman, Jay (2006). "A solution to the random assignment problem on the full preference domain". Journal of Economic Theory. 131: 231–250. doi:10.1016/j.jet.2005.05.001.
  8. ^ Yılmaz, Özgür (2009). "Random assignment under weak preferences". Games and Economic Behavior. 66: 546–558. doi:10.1016/j.geb.2008.04.017.
  9. ^ Athanassoglou, Stergios; Sethuraman, Jay (2011-08-01). "House allocation with fractional endowments". International Journal of Game Theory. 40 (3): 481–513. doi:10.1007/s00182-010-0251-9. ISSN 1432-1270. S2CID 15909570.
  10. ^ Budish, Eric; Che, Yeon-Koo; Kojima, Fuhito; Milgrom, Paul (2013-04-01). "Designing Random Allocation Mechanisms: Theory and Applications". American Economic Review. 103 (2): 585–623. doi:10.1257/aer.103.2.585. ISSN 0002-8282.
  11. ^ Ashlagi, Itai; Saberi, Amin; Shameli, Ali (2020-03-01). "Assignment Mechanisms Under Distributional Constraints". Operations Research. 68 (2): 467–479. arXiv:1810.04331. doi:10.1287/opre.2019.1887. ISSN 0030-364X.
  12. ^ Aziz, Haris; Stursberg, Paul (2014-06-20). "A Generalization of Probabilistic Serial to Randomized Social Choice". Proceedings of the AAAI Conference on Artificial Intelligence. 28 (1). doi:10.1609/aaai.v28i1.8796. ISSN 2374-3468. S2CID 16265016.
  13. ^ Aziz, Haris; Brandl, Florian (2022-09-01). "The vigilant eating rule: A general approach for probabilistic economic design with constraints". Games and Economic Behavior. 135: 168–187. arXiv:2008.08991. doi:10.1016/j.geb.2022.06.002. ISSN 0899-8256. S2CID 221186811.
  14. ^ Freeman, Rupert; Shah, Nisarg; Vaish, Rohit (2020-07-13). "Best of Both Worlds: Ex-Ante and Ex-Post Fairness in Resource Allocation". Proceedings of the 21st ACM Conference on Economics and Computation. EC '20. Virtual Event, Hungary: Association for Computing Machinery. pp. 21–22. arXiv:2005.14122. doi:10.1145/3391403.3399537. ISBN 978-1-4503-7975-5. S2CID 211141200.
  15. ^ Aziz, Haris (2020-12-07). "Simultaneously Achieving Ex-ante and Ex-post Fairness". Web and Internet Economics. Lecture Notes in Computer Science. Vol. 12495. Berlin, Heidelberg: Springer-Verlag. pp. 341–355. arXiv:2004.02554. doi:10.1007/978-3-030-64946-3_24. ISBN 978-3-030-64945-6. S2CID 214802174.
  16. ^ Babaioff, Moshe; Ezra, Tomer; Feige, Uriel (2021-02-09). "Best-of-Both-Worlds Fair-Share Allocations". arXiv:2102.04909 [cs.GT].
  17. ^ Hoefer, Martin; Schmalhofer, Marco; Varricchio, Giovanna (2022-09-08). "Best of Both Worlds: Agents with Entitlements". arXiv:2209.03908 [cs.GT].