Archives for January, 2010

After the New York Times announced its metered paywall last week there has been a lot of empty blather. Standing out from all the noise are two very good analyses. The first was by Felix Salmon for Reuters, analyzing a consumers decision of whether or not to pay. The second one was by Jonathan Stray on Nieman Lab, showing the effect of several different variables on revenue.

This stuff is right up my alley, and I’m currently working on a senior thesis in the field and so I’ll try to extend Salmon’s analysis a little bit. Later on, I’ll take on Stray’s model as well.

Salmon’s Analysis

Let’s say a reader in a given period reads N articles from the New York Times. Then suppose the New York Times sets the paywall after a consumer has read some n<N articles. In order to read the n+1th article, the reader must pay a fee of F. If v is the value the reader gets from each article, then he will only pay the fee if v\left (N-n  \right ) > F. This is a good simple model synopsis.

Article Values are Different

Let n,N,F be as before. The first issue that jumps out is that the value of any given article is not constant. The value of articles over a period varies, so let’s arrange them in order of value from highest to lowest.

Let \{v_i\}_{i=1..\infty} be a monotonically decreasing sequence of article values for our reader, with v_i = 1 \:\forall\: i>N. Then the reader gets value,

u(v)=\left\{\begin{matrix}\left (\sum_{i=1}^{N}{v_i}  \right ) - F &if\;\sum_{i=n+1}^{N}{v_i} > F\\ \sum_{i=1}^{n}{v_i} &if\; \sum_{i=n+1}^{N}{v_i} \leq F \end{matrix}\right.

The reader would clearly choose to read the articles he values most first, and after that only pay the subscription if the rest of the articles he has yet to read are still valuable enough. Only if \sum_{i=n}^{N}{v_i} > F will the reader pay the fee.

But this is not quite right either. There’s no way for a reader to know ahead of time which articles are most valuable to him.

Predicting future value

Now, instead of ordering the values of articles from highest to lowest, let’s say that the value of articles our reader reads are drawn independently from a probability distribution. Let the value of articles be a random variable V \sim N\left ( \mu,\: \sigma^2 \right ) with a normal distribution and \mu_x the average value of an article. V_1, V_2, V_3,\cdots are the value of the first article read, second article read, etc.

Let the period of time for which the reader pays be represented as \left [ 0,1 \right ], and the moment when the reader has read n free articles and must choose whether or not to pay the fee be at time t\in \left [ 0,1 \right ]. Assume the reader reads articles at some constant rate r throughout the entire period. Then t= \frac{n}{r}.

Now the reader must predict what the value of articles he will read will be to determine whether or not he should pay the fee. Up to point t, he has gotten value \sum_{i=1}^{n}{V_i} and average value per article of \overline{V}= \frac{\sum_{i=1}^{n}{V_i}}{n}. \overline{V} is also the sample mean of the distribution.

Result

Our reader will choose to pay the fee if \left ( 1-t \right ) r \frac{\sum_{i=1}^{n}{V_i}}{n} > F. As r goes up, so does F and as n goes up, F goes down.

There are some interesting suggestions from this. When the New York Times imposes the paywall, they should carefully monitor the rate at which people read its articles. Those that have a low rate would be ideally suited for targeted discounts. Also, since readers make their predictions based on past articles they’ve read, the ideal time to convert non-paying readers is right after a reader reads a series of good articles. If the Times can be subtle about dialing up and down n, then they can exploit variance in article value to increase sales.

Further work

This analysis is of course still incomplete. Problems I still see with it.

  • Knowing that you’ll only get a limited amount of articles for free will change a reader’s behavior. If they’re still uncertain about whether or not paying the fee will be worth it, they will more carefully pick which articles they read before time t. This will bias \overline{V} upwards, but push r downwards. At time t, there will also be a back-log of articles that would have been read but weren’t influencing the decision of whether to pay F or not.
  • How will the reader decide whether or not to read an article before time t? He’ll have to depend on the headline and a summary if available to make a prediction. Before actually reading the article, the reader will predict some value V_{i}' and after reading the article realize some value V_i. This average spread \frac{\sum_{i=1}^{m}{V_i-V_{i}'}}{m} will likely affect predictions of future value.
  • As is, the model says decreasing n and increasing F leaves the reader’s decision of whether to buy unchanged. But as n\rightarrow 0 this becomes a strict paywall, which the gut says people would be less willing to pay for. Another factor in the reader’s decision of whether or not to pay is their confidence about their decision. The larger n is the more confident they will be about their value prediction since the sample mean’s standard deviation will fall, as \overline{V} \sim N\left ( \mu,\: \frac{\sigma^2}{n} \right ).
  • Paywalls, as described by the New York Times and as currently implemented by the Financial Times and WSJ, are easily bypassed. This can be done either by spoofing the referrer header, or by clearing cookies. This avoidance could also be modeled in in some way.
  • Letting people in for free if they come via social media or links from other sites screws everything up. I think this may turn out to be such a huge gaping hole in the paywall that they severely restrict it, but if they don’t there are several ways it can be modeled.
    You could divide articles between different distributions of those that are primarily found through social media and those that aren’t. The reader would choose whether or not to pay based on the value of those that aren’t. Alternately, an article’s ability to be found through social media could just affect its V_i.
  • Print subscribers get free access as well. In Salmon’s post he looks at P-F, the difference between print subscriber’s fee and online subscribers. If this is less than the value of getting the print paper then the reader will choose the print subscription.
  • What if users can choose between a short period, and a longer period with a discount? What does the renewal decision look like?

There are undoubtedly more things that can be done with this model. One of the most obvious is to try and figure out what n and F should be set to.

Finding good values for F and n

Since it’s reader’s will not have the same distribution for V it would be theoretically ideal to pick values for n and F individually for every reader. Realistically, the New York Times probably shouldn’t be that opaque about their pricing as it would cause confusion and a negative reaction among readers.

If forced to pick a single price, it would be necessary to find the average value of articles for all readers. That’s what Stray did with his paywall simulation. However, part of the reason that simulation has such wild swings in revenue from relatively small changes is because many of the variables are dependent on each other. For example, the percentage of people who pay for a subscription does not stay constant when n or F change.

I’ll tackle this issue more in my next post.

Special Bonus! A pricing algorithm for the FT

This part might still be a bit half baked, but working backwards from the consumer’s decision, it seems possible to figure out a demand curve for each individual piece of content if enough data is available. Since the Financial Times already has a metered subscription plan, if they’ve been good about collecting user data they should have what’s necessary to do this. Here’s an outline of the method.

It requires some change of notation from the above.

Let a_i \in A \;\forall i\in\mathbb{N} be an article, and x_i \in X \;\forall i\in\mathbb{N} be a reader. We will now represent the value of an article to a reader as a mapping V: A\times X \mapsto \mathbb{R} with V(a_i,x_i) representing to the value of article a_i to reader x_i. The functions F(x_i) and r(x_i) replace F and r as the fee and rate for reader x_i. n is as before.

Define the set R(x_i) such that a_i \in R(x_i) \textsl{ iff } x_i reads a_i before deciding whether or not to buy.

So our former equation \left ( 1-t \right ) r \frac{\sum_{i=1}^{n}{V_i}}{n} > F becomes \left ( r(x_i)-n \right ) \frac{\sum_{a_i \in R(x_i)}{V(a_i,x_i)}}{n} > F(x_i).

Rearranging, we get \frac{\sum_{a_i \in R(x_i)}{V(a_i,x_i)}}{n} > \frac{F(x_i)}{r(x_i)-n}.

The left side of the above equation is the average value of an article that a reader reads before making the buying decision. So if x_i does buy a subscription, we then know that the average value was at least the right side.

Now that we have an estimate of a given readers average value for content we want to estimate that value across all readers. For any given piece of content, some fixed a_i, to determine its value we sum the average value for content of all readers who read a_i before purchasing, and then divide by the total number of readers (who aren’t already subscribers) who’ve read a_i.

Define, \overline{V(x_i)}=\begin{Bmatrix}\frac{F(x_i)}{r(x_i)-n} &,\:if\: \frac{\sum_{a_i \in R(x_i)}{V(a_i,x_i)}}{n} > \frac{F(x_i)}{r(x_i)-n}\\ 0 &,\:if\: \frac{\sum_{a_i \in R(x_i)}{V(a_i,x_i)}}{n} \leq \frac{F(x_i)}{r(x_i)-n}\end{Bmatrix}.

Equivalently, \overline{V(x_i)}=\begin{Bmatrix}\frac{F(x_i)}{r(x_i)-n} &,\;\text{if x buys}\\ 0 &,\;\text{if x does not buy}\end{Bmatrix}.

This function \overline{V(x_i)} is an estimator of the average x_i has for an article.

Now define the set S(a_i) such that x_i \in S(a_i) \textsl{ iff } x_i reads a_i before deciding whether or not to buy a subscription. This set is all non-subscribing readers that read article a_i in the current period, whether or not they’ve ultimately paid for a subscription by the end of the period or not.

If we take \overline{V(x_i)} for each x_i in the set S(a_i), we have a distribution of estimated values for article a_i. That might look something like this.

Article values

Finally, to come up with a set value for a specific piece of content, we sum over the entire set and divide by the number of readers.

P(a_i)= \frac{\sum_{x_i \in S(a_i)}{\overline{V(x_i)}}}{\left | S(a_i) \right |}

With this value, you can now derive a demand curve for the entire site. Or you can dynamically set prices based on what articles a reader has viewed before hitting the paywall.

Exciting stuff, if actually implemented.

If you think I’ve screwed up the math in some way, or if anything isn’t clear, please please let me know. The thoughts in this post are still very much a work in progress.


Hope everyone has had a good holiday! Did you buy gifts for people? Or receive them? From a pure economic efficiency perspective you shouldn’t spend a second on holiday shopping and just give those around you cold hard cash. After all, lump sum transfers are most efficient in redistributing wealth, and there’s no way you could know a recipients preferences better than they do. (Exceptions for parents who’ve received letters to Santa of course)

Unfortunately this is empirically very unpopular (in the US at least). As Greg Mankiw explains, it’s about signaling.
Social interactions aren’t just about exchanges of economic value, they’re an intricate dance of signaling to others, posturing to others, projecting an image, fitting in with the cool crowd, and all manner of things a high-schooler would be embarrassed to admit to.

The content we consume becomes part of that intricate social game of posturing and positioning that we all play to navigate our social spheres. And luckily for content publishers, this all can be exploited.

Social Content

2009 may be remembered as the year that Facebook and Twitter topped the search engines in sites’ referrer logs. Increasingly, people are turning back to their friends and acquaintances to point them to accurate information, interesting news and entertainment. Search has become the victim of spam and gaming of algorithms, leading to online social spaces becoming a more reliable way of finding what is good online. Fred Wilson gives a clear summary.

For me, social has become the definitive content discovery mechanism. As this happens, content begins to look less and less like a commodity. In search engine land, content is ruthlessly and algorithmically ranked and rated in value and relevance. No one has a personal relationship with a search engine.

Information with social context becomes part of our relationships with other people. And no one is better at persuading us to do things we might otherwise not do than our friends.

For more evidence that social interactions destroy rational economic thinking, one need look no further than luxury goods that people consume and enjoy more simply because they are expensive and can signal to others that they are of high status. Demand for these goods actually rises as they become more expensive. Or take the vicissitudes of fashion, which require people to swap out their wardrobes on a continuous basis in order to convey their social status to others.

Applications

How to tap into this economic irrationality? Get people to consume goods in a social space that displays their consumption to others. People do this on the subway by holding open the covers of their magazine or book. People do it online by sharing links to their favorite sources and stories.

More specifically. Publishers could erect a paywall on a site that takes advantage of these social tendencies by giving paying subscribers the ability to share content by whisking their friends and followers past the paywall. With customized links, or a custom link shortening tool that meters the number who use the link it would be possible to set up multiple tiers of sharing and charge more to let someone share with more people.

This also lets people band together and buy subscriptions. It sounds bad for the bottom line, but in the economic literature, its been shown that allowing this kind of sharing, can actually improve profits for a content distributor. What causes this counter intuitive result? In a paper by Bakos, Brynjolfsson and Lichtman, they explain that this kind of social sharing works to aggregate widely distributed consumer valuations for content and present a more favorably shaped demand curve to the producer.

Content producers continue to struggle with monetizing their products online, and it looks increasingly unlikely that advertising will be the solution because content sites operate at the level of “intent generation” not “intent harvesting”. Perhaps by making the consumption of content more of a social experience, producers will have more success.