Every February, the State’s attention seems to turn toward the North Slope. People become interested in whether or not oil production has increased, and what the experts expect to happen next. Inevitably, this leads to questions about “fair share” and “competitiveness.”
People voice different interpretations of the data they are presented. People like me have to try to explain how a reduced decline rate is actually an increase in production. And then there’s the speculation about what caused the changes we see.
Eventually we start our annual debate about oil taxes. But of course our confirmation biases prevent us from hearing what others say. The conversations fill with hyperbole and rhetoric. The lobbyists and consultants flood into the Capital building, spinning the numbers to fit their talking points.
And the whole system is complicated enough that you can find data to support any position you have. You can focus in on a single variable to validate your beliefs, even if the larger picture washes out the findings. Or you can just torture the data until it confesses to the finding you want to be true.
In the end, the picture is so muddy that no one really knows what the data are saying. How can we know which tool to use if we don’t understand the problem?
That is what I am trying to help fix through this endeavor. I want us to have informed debate based on factual information. That’s what this blog is all about. No spin, no agenda, just the facts.
So let’s start by looking at the oil industry. There is no question that the oil industry is important to Alaska’s economy. It provides good paying jobs to thousands of people. Those paychecks help support the rest of the economic system. The tax and royalty revenues the industry contributes are what pay for almost all of the government services we enjoy. And the money that those resources generated in the past is what provides us with our dividend check each October.
The health of the oil industry is vital to our economy. And understanding what is going on in the industry must be a starting place for any broader conversation. That is why it surprises me that we don’t have a better handle on the data. And without a good understanding of the data, how can we speak intelligently about what those data mean? Why do we provide so little effort into understanding what is going on and what will happen next?
Right now, the only agency that provides a forecast of the oil production is the Department of Revenue (DOR). Yet, their data is confidential and their forecasts are weak. Historically, actual production has come in below their forecast 96% of the time.
While I helped to improve the situation back in 2012 (when I worked for DOR back then, I was part of the reason we introduced “risking” to the forecast), there is still work to be done. Now that DNR is helping generate the production forecast, it is a little better. But there is still room for improvement.
DOR also provides a price forecast, which combines with the production forecast to project future state revenues. As I said before, the oil price forecast is not very good either (ironically they tend to miss in the opposite direction).
The State relies too much on these forecasts for them to be unreliable. Especially now, as budget cuts, taxes, and reductions to the PFD are perennial issues that threaten government shutdowns.
We need an unbiased, non-partisan, third-party to intervene and interpret the data. Someone that can double-check the numbers, understand the issues, and inform the conversation. That’s the role I’m trying to fill.
But the problem isn’t just in the forecasts. It starts with the data. If we can’t agree on what history was, how can we possibly agree on what the future may be?
For example, look at the production data for the last 15 years from the various sources that provide it.
Let me help reconcile why these numbers are different.
In case you want to double-check my numbers, let me explain how I got these data. The DOR RSB figures come directly from its annual publications. DOR receives these data as part of tax filing requirements, and therefore the data are confidential in detail.
The DOR website also provides daily throughput numbers, which it receives from Alyeska every night. You can get them here.
The DNR data come from its website. These data are provided to fulfill the royalty requirements of the lease agreements. These data are very detailed and are not confidential. However, they only provide the detailed information back to 2012 on the website. This data is in a report format, so it’s a pain to use for anything else.
AOGCC has a fantastic data extraction tool you can access here. These data are provided to the Alaska Oil and Gas Conservation Commission at a very detailed level by the producers. These data are not confidential except in very specific circumstances.
And EIA publishes its numbers here, but they are delayed compared to other sources.
Reconciling Production Data
So what was production in 2017? These different sources provide answers ranging from 527,912 to 479,986 barrels of oil per day. That’s a 48,000 barrel per day difference between sources. How can we be certain if actual production rates differ from forecasts due to errors in forecasting or errors in reporting? And if you try to combine data sources (like using AOGCC data to forecast what DOR will report), good luck.
Fortunately, there are good reasons for the different data reporting. But, you need to understand these differences before you use the data.
AOGCC is the Source Data
It turns out, AOGCC is the most reliable data source. It is reported down to the well level, on a two month delay. And these data are publicly available. From what I can tell, these data are the source data for everyone else.
Let’s start with the EIA. This discrepancy was the most concerning to me, since it is consistently lower than the others. I dug around their disclaimers, made a few phone calls, and played with the AOGCC data until I figured it out.
The EIA uses the AOGCC production data, but does not include the production of Natural Gas Liquids (NGLs). They also report the numbers as calendar year averages.
Once I made these adjustments, the data line up almost exactly (any accountants out there might still wonder about the remaining difference. It bothers me too, but let’s just call it a rounding error).
|AOGCC FY||EIA||AOGCC CY without NGLs|
So, I don’t recommend you use the EIA data. It’s incomplete.
One down, 2 to go.
Let’s tackle the DNR question next. On its website, DNR makes available a production history file. That file is expressed in calendar year averages. When I compare those number to AOGCC numbers (also expressed in calendar year averages and including NGLs), they look to be very much the same. However, there is a 9,000 barrel difference in 2005 that cannot be explained. I believe this is a data entry error is one of the agency’s data (here is an example of why we need third-party corroboration).
|CY||DNR Prod (CY)||AOGCC (CY including NGLs)|
Production and Transportation Differences
The other file available on DNR’s website is titled “TAPS throughput.” Looking at the difference between throughput and production should be enlightening.
|CY||DNR Prod (CY)||DNR TAPS (CY)||Difference|
TAPS is the only way for oil to move off the North Slope. So, the difference between production and throughput should be the amount of oil that is used on the North Slope. There are only two ways this happens. First, a small amount of oil is refined at the topping plants at Kuparuk and Prudhoe Bay. Second, NGLs make for good enhanced oil recovery (EOR) fluids. When a field doesn’t have the native NGLs it needs to perform EOR operations, it gets them from somewhere else. For years, this was happening between Prudhoe Bay and Kuparuk.
Looking at the table above, the volume of produced oil that stays on the North Slope looks to be decreasing steadily as production declines. That is, until 2014. At that point, something changed. I happen to know what that change was.
According to the Petroleum News, ConocoPhillips converted the Oliktok Pipeline from NGL transportation to fuel gas in late 2014. That pipeline was previously delivering NGLs produced from Prudhoe Bay over to Kuparuk for miscible injection (a type of EOR). By the way, the article also says that the pipeline is returning to NGL transportation this month (August 2018).
Royalty and Production Report Variance
Next, we need to resolve the differences in the production report and the royalty report on the DNR website. This one is actually simple. The royalty report only includes State leases (and only goes back to 2012).
|CY||Royalty Report||Production Report||Difference|
The difference is production from Federal and ASRC lands. The increase in 2016 is production from CD5.
Now we are just left with the DOR numbers.
Unfortunately, DOR does not provide monthly data. So, let me convert the AOGCC data to fiscal year and see how DOR compares. We already saw that AOGCC data is the source data for EIA and that is matches the DNR reports. So it should line up with DOR too.
These differences look familiar. It appears that DOR does not count reinjected NGLs as production. So, if you want to use AOGCC data to forecast DOR production numbers, you need to account for reinjected NGLs.
DOR Transportation and Production Variances
Ok, one last item. How close are the daily run tickets that Alyeska provides to the final production numbers that DOR reports in its annual publication? Since the DOR numbers pull the reinjection numbers out, they should be close. Let’s check.
It looks like the daily throughput reports are a very reliable indicator of the DOR production numbers (other than an anomaly in 2008). This means that you can use the daily data to update your forecasts in real-time without corrupting your data.
Data can be tricky. While there are very good reasons for the differences in the data, we need to be careful when we use them. Here are a few recommendations from me.
First, I recommend that we all use AOGCC production data when we talk about Alaska oil production. This is going to become a more serious issue as production from GMT, Willow, Liberty, and ANWR start to flow.
We need to agree on what counts as “production” when some is taxable, some is royalty bearing, some is neither, and some is reinjected into the ground. We are going to need a way to differentiate those production types without confusing the conversation.
Second, be careful to ask what the data mean whenever it is presented to you. Comparing fiscal year and calendar year data is a common mistake that can lead to spurious conclusions. Comparing different data sources can lead to similar confusion.
And finally, let’s get a better handle on the data we are using to make important policy decisions. I’ll be here trying to help, but only for as long as I can afford to do it. If you want to keep me working on things like this, consider making a donation to the cause.
2 thoughts on “Reconciling Oil Production Data”
Appreciate this information Ed. I’ve wondered about this.
Is it safe to assume Alyeska measures throughput at the TAPS terminus? GVEA has a pipeline in Fairbanks that connects to TAPS for a local refinery (or two when they are both operating). That would, I assume, account for part of the difference between production and throughput. Thanks again.
Alyeska sends a report out a midnight every night that includes the Pump Station 1 and the Valdez terminal volumes. I’m not sure which number DOR publishes, but I believe it is the PS1 number. You are correct that a few thousand barrels come off at the refinery in North Pole, but I’m almost positive those numbers are included in the production total. .