Loading [MathJax]/jax/output/CommonHTML/jax.js

A Toy Model of College Admissions

Allen Sirolly / November 27, 2020


The chart below from a 2019 Pew Research article caught my attention the other day. It shows a concerning trend of increasing selectivity at colleges which has been written about in a number of other places as well.1 The article reports that the average number of applications per enrollee increased from about four in 2002 to 6.8 in 2017, with the increase likely even more severe for competitive colleges. While this doesn’t necessarily make it “harder” to get into these schools (it could just be that more students who would have been non-competitive in previous years are applying), the rise in the average number of applications per applicant can certainly bring with it a concomitant increase in anxiety for successive waves of prospects.

My goal is to build a model for a multi-period college admissions market which, through some mechanism, gives rise to a vicious cycle of increasing application volume and lower admission rates. This is a big task which I don’t expect to fully accomplish here; instead, I’ll use this post to sketch out a simple “toy model” which might serve as a starting point. The main idea is that applicants shrink their beliefs about their chances of admission toward a public signal, such as last year’s admission rates, in such a way that the resulting admission rates will be slightly lower this year. I’ll start by considering a one-period model where applicants solve a utility maximization problem.

The Model

Colleges

Suppose there is an ordered set of colleges K indexed by k=1,,K. Each has capacity Ak for an entering freshman class. Colleges receive reports of ˜Wi (a noisy signal of applicant i’s ability/“type” Wi) from all applicants and must decide whom to admit based on capacity and expected yield. Colleges prefer applicant i to applicant j if ˜Wi>˜Wj, and applicants prefer college k to k if k<k.

To compute the results of an admissions round, I pretend that decisions are made sequentially and publicly, such that college k observes the admittances of all higher-ranked colleges (k<k). The admissions process follows a simple algorithm2:

  • The top-ranked college sorts its applicants by ˜Wi (descending) and admits the top students up to capacity A1 (all of whom match with it, for a 100% yield);

  • The second-ranked college sorts its applicants by ˜Wi (descending) and admits all who were admitted by the first-ranked college, plus the top students who were not admitted by the first-ranked college, up to capacity A2;

  • And so on for k=3,,K.

The embedded assumptions are that (a) colleges do not have minimum requirements for ˜Wi and (b) colleges will not reject applicants for being “overqualified”3 (in the sense of having a ˜Wi higher than another admit).

Applicants

The choice problem

Each applicant i has utility function ui(k) over K and faces a constant application cost c. Each has an ability (“type”) Wi and beliefs—shaped by priors and public information—about the probability of acceptance to college k given Wi. The problem the applicant faces is to choose a set of colleges Ci to which to apply; from the subset of admittances, the applicant will choose the college which maximizes utility. Assume that the utility of not being admitted to any college is normalized to 0, with ui(k)>0 for all k.

Let P(S) denote the power set of S, so that CiP(K) are the possible application/choice sets4 and SP(Ci) are the possible sets of colleges to which the applicant is admitted. Formally, an applicant will solve

maxCiP(K){SP(Ci)Pi(S)maxkSui(k)c|Ci|}

where Pi(S) is the probability that i is admitted to colleges in S and rejected from colleges in CiS. Applicants believe that admissions decisions by colleges are independent, so that

Pi(S)=kSP(admit at kWi)kCiS(1P(admit at kWi)).

P(K) is potentially a very large search space; I simplify the problem by using a greedy algorithm which at each step (starting from Ci=) adds the college k to Ci which yields the highest increase in expected utility, provided there is one. (update 12/12/2020: here’s a proof that this algorithm finds the optimal C(W).) Note that by adding k to Ci, an applicant’s expected utility will change by

P(admit at kWi)[SP(Ci)Pi(S)max{0,ui(k)maxkSui(k)}]c

This expression can be simplified by considering each “best option” in Ci to which i may be admitted:

P(admit at kWi)[Pi()ui(k)+kCiPi(k is best option)max{0,ui(k)ui(k)}]c

The corresponding probabilities are easy to compute by the independent admissions assumption.

Beliefs

Suppose that types are distributed WG(w). Each applicant knows her own type as well as the distribution G, and sends colleges a noisy signal ˜Wi=Wi+ϵi, ϵiF(ϵ), where F is known but the realization of ϵi is unknown. I assume that prior beliefs are consistent with a view of the admissions process as a perfect sorting mechanism, such that the jkAj applicants with the highest ˜Wi’s will be admitted to the k highest-ranked colleges. (This actually agrees with the outcome of the colleges’ decision process if every prospect sends an application to every college, which is what happens when c=0.5)

Let ˜G be the marginal distribution of ˜Wi (which applicants can calculate using knowledge of F and G). After observing public information Ik, e.g., last year’s admission rate at college k, the posterior beliefs are

Pα(admit at kWi)=(1α)P(1˜G(˜Wi)<jkAjjAj)+αIk

where α[0,1] represents the degree of shrinkage. There are certainly other possibilities: maybe it’s more realistic that applicants only update their beliefs to make themselves overconfident about their chances at more selective colleges.

One challenging problem (which I won’t attempt in this post) is to figure out the beliefs and utility functions which result in an admissions outcome that is consistent with {Ik}, in an equilibrium sense, for the c>0 case.

Simulation

The features of this model can be studied via a simulation exercise, in the spirit of agent-based modeling. I’ve made the code available on my GitHub.

For the simulation I construct a set of K=50 colleges, each with capacity Ak=100. There are kAk=5000 prospective applicants with

WiN(0,12)˜WiWiN(Wi,0.12)

so that ˜WiN(0,1.12). The public admission rates are

Ik=(1k1K1)r_+k1K1¯r

with (r_,¯r)=(.05,1), which corresponds to Ik=jkAj/jAj in this example. The utility function (common to all applicants) has the form

ui(k)=Iβk+γ(Kk)

for β>0 and γ>0. The latter parameter guarantees that there is a minimum increment in the utility function between consecutively-ranked colleges. I normalize the utilities so that kui(k)Ik=1000.

I computed the “results” of this artificial admissions process for a grid of parameter values, which led to some interesting charts:

Set of utility functions considered.
Applicants’ beliefs about probability of acceptance at each college, for different values of (Wi,α). Dashed line is the public information Ik.
Application volume by college rank for γ=3 and different values of (α,β,c).
Acceptance rates for γ=3 and different values of (α,β,c). Dotted line is Ik.
Yields for γ=3 and different values of (α,β,c). Dotted line is Ik.
Average number of applications per applicant by ability/type percentile, for γ=3 and different values of (α,β,c).
Total number of applications to each college, by applicant ability/type percentile, for γ=3, β=1.5, and different values of (α,c).
Number of admittances to each college, by applicant ability/type percentile, for γ=3, β=1.5, and different values of (α,c).

That’s it for now! More to come in a future post…


  1. I tweeted about it, too.

  2. This is similar to what’s known as a “serial dictatorship mechanism”, the difference being that there is still an application process that forces students to make decisions about which colleges to apply to.

  3. A richer model might incorporate incentives for colleges to keep their admission rates low.

  4. In practice, applicants will face an application limit L so that the choice set is Ci{CP(K):|C|<L}. The Common App presently limits students to 20 applications.

  5. The acceptance rate for college k will be jkAj/jAj, and the yield Ak/jkAj.