Friday, September 23, 2016

Federal 2PP Aggregate Methods (45th Parliament)

This is the methods post for the 45th Parliament edition of the aggregate (sometimes nicknamed "Aggedor") that I post in the sidebar of this site, and which will form the basis for Poll Roundup posts and, later, my attempts to forecast the next election.

The current version is essentially the same as the version running at the end of the 44th parliament, with some minor changes to the weightings and the interpretation of Newspoll.  One substantial methods change was made half-way through the 44th parliament, which was to switch from just using the 2PP figure supplied by pollsters using previous-election preferences, to using a hybrid of the supplied figure and a figure calculated from their primaries.

A simpler version ran before the 2013 election and fluked getting the 2PP exactly right as a result of a preference shift cancelling out a possible late swing to the Coalition.  The 2013-6 version had a final error of 0.4 points, almost half of which resulted from slight shifts in preference flow patterns.

The aggregate is mostly a weighted average of two-party preferred polling derived from all recent polls of sufficient standard.  The 2PP figure assigned to each poll is multiplied by various weightings based on the poll's recency, accuracy and other issues, and the sum of the multiplied poll scores is divided by the sum of the weightings.

The aggregate is designed to be transparently checkable in theory and to use basic mathematics only.  However, it is not entirely codified in advance.  Decisions will be made on issues of pollster weighting and house effects, and possibly other matters, and will be updated to this page at the bottom when made.



Pollsters Included:

Polls by the following pollsters are included when available: Galaxy (including Newspoll), ReachTEL, Ipsos, Essential,. Morgan.  Other pollsters may be added subject to their completion of three polls of national 2PP voting intention by the same methods, and adequate methods documentation, and previously active polls may be reincluded.  Even if a poll is by an included pollster, the poll is not included if it is an "internal poll" (defined as a poll commissioned by a political party, union, business or lobby group).  Polls that are known to include aggregations of polls by other pollsters are rejected.

One Nation Adjusted Figure:

The headline figure of the aggregate is a purely last-election based method because last-election preferences have a very strong record of historic accuracy.  However, the One Nation preference flow at the 2016 election was aberrant, based on a very small number of seats contested.  The sidebar figure therefore includes a preference estimate assuming a different flow from One Nation.  The different flow is based on an average of preference flows from the two federal elections at which One Nation most recently polled a significant lower house vote (1998 and 2001), the preference flows from the two state elections at which One Nation has most recently polled a significant lower house vote (Queensland and Western Australia), and the Longman by-election.  The assumed preference flow in the One Nation adjustment is 60.1% to Coalition.  The assumed One Nation primary at any time is the average of what it was in the last three Newspolls/Galaxys.

2PP calculations:

For every poll, a 2PP is calculated from the primaries, using last-election preferences.  For instance,

Coalition 2PP = Coalition Primary + .181*Green Primary + .508*Others Primary - 0.13

or

Coalition 2PP = Coalition Primary + .181*Green Primary + .398*NXT Primary + .505*One Nation Primary + .529* Others Primary - 0.13

(The formula used depends on whether the poll publishes breakdowns for NXT and One Nation or not.  The -0.13 is an adjustment for the impact of three-cornered Liberal/National contests).

For the United Australia Party, 2013 Palmer United Party preferences (.537*UAP to Coalition) are used.

If the poll does not release a 2PP based on preferences from the previous election or recent elections, then the above figure alone is used, rounded to one decimal place.  This also applies when the pollster was using figures from the election before last (eg polls using 2013 preferences in July-August 2016). I also reserve the right to apply it if a pollster's 2PP calculations are persistently suspicious (in which case I will log it on this page).

If the poll releases a last-election 2PP but the primaries have a greater level of detail, then the above figure alone (rounded to one decimal place) is used, provided that it rounds to the same 2PP.  If it doesn't then the closest figure that so rounds is used.

For other polls except Newspoll and Ipsos, the 2PP that is used is the average of:

1. The last-election 2PP released by the pollster
2. The calculated 2PP based on the released primaries (formula above).

This is rounded to one decimal place, with the constraint that the maximum allowed difference between the number thus found and the pollster's released 2PP (prior to applying any house effect correction) is 0.2 for Morgan, and 0.4 for all remaining pollsters.  So if a Newspoll released 2PP is 51, the aggregated 2PP will be in the range 50.6-51.4.

For Galaxy/Newspoll specifically, the following was applied for polls from December 2017 to mid-April 2019.  The average difference between the last-election 2PP calculated from the primaries and the published 2PP for the previous ten Newspolls (as of April 2019 this is 0.6 points) is added to the Coalition 2PP.  Then the average above is found (subject to the above constraints) and then the average difference is subtracted.  Thus for the late April Newspoll (published 2PP 49 for Coalition, last-election 2PP 47.6 for Coalition, I add 0.8 to the last-election 2PP to get 48.4, average that with 49 to get 48.7, and deduct 0.8 to get 47.9, which is the aggregated value.)

New Galaxy/Newspoll method: as of late April 2019, Galaxy/Newspoll started applying new preferencing assumptions for United Australia Party.  As there will not be enough time to measure preference differences under these new assumptions, the following is now applied:  The 2PP for the primaries is found by the calculated 2PP method for other polls, but with both One Nation and UAP assumed to flow 60% to Coalition.  This is then averaged with the last-election 2PP released by the pollster, and the difference between the calculated method with One Nation and UAP set to 60% and with them set at 50.5% and 53.7% respectively is subtracted.

Ipsos uses "batched" last-election 2PPs which assume preferences from all minor parties will flow the same way overall, even if the ratio of Greens to other minor parties changes.  For Ipsos specifically I use a weighting of 40% for the last-election 2PP and 60% for the calculated 2PP.  The maximum allowed difference constraint does not apply for Ipsos. Ipsos also releases respondent preferences, which are ignored.

Time weighting:

The aggregate is designed to be fairly cautious in adjusting to apparent changes, in case these are actually caused by sample noise. The base time weighting is 5 for a poll released this week, 3 for last week, 2 for the week before last, 1 for the week before that.   At present a new polling week is considered to start from Saturday with the reset being applied from midnight Friday.  Intermediate weightings of 4 for a poll bridging this and last week, 2.5 for a poll bridging one and two weeks ago, 1.5 for 2-3 weeks ago, 0.5 for 3-4 weeks ago, are used in the case of polls that use rolling data for two weeks (a la Essential) or that derive their data entirely from the middle or end of the previous week (eg ReachTEL often polls on Thursday.)

So for instance in the typical week the new Essential carries a time weighting of 4 when it is released, but this is reset to 2.5 when the next polling week starts.  A ReachTEL released on Thursday or Friday based on polling on Wednesday or Thursday is weighted at 5 but this is reset to 4 when the next polling week starts.

During election campaigns, in the last three weeks a weighting of 8 is applied for polls based entirely on data sampled (not released) in the last four days.  In the last week a weighting of 10 is applied for polls based entirely on data sampled (not released) in the last two days.  Only the most recent poll by any pollster can qualify for these weightings.

Number of polls included:

Only the two most recent polls by any pollster are included.  Where a pollster uses an overlapping method, such that part of the same sample appears in two different polls, only the two most recent completely independent results are included.  (eg under Essential's former two-week rolling method, the polls included would have been the current poll and the poll before last.)

Quality weighting:

Pollsters are weighted based on my view of their performance - not just in final polls but through campaign cycles and between campaigns.  I reward polls for consistently using the same methods with good results at multiple elections, and for being the best pollster at the most recent election.  Polls can be marked down for irregular tracking behaviour, for consistently having the vote of a particular party too high, for changing methods or not conducting polls very close to an election, and so on.  Based on experience in the previous cycle, I have increased the spread of ratings for this one.

The maximum weighting for a poll that has appeared fewer than eight times is 0.8.

The current quality weightings are:

1.3 YouGov Galaxy, Newspoll
1.10 ReachTEL
0.85 Essential
0.8 Ipsos
0.3 Morgan

If only the 2PP is released for a poll and not the primaries then the quality weighting is halved.

A quality weighting of 0.6 was applied for the YouGov-Fifty Acres series, which ceased from late November 2017.

For my full review of pollster performance at the 2016 election see here and for the 2013 election see here.

Respondent vs Last-Election Preferences:

Most recent elections have shown that last-election preferences are the most reliable method while respondent preferences skew to Labor and create needlessly volatile poll readings.  The 2016 election was no exception to this. Therefore this aggregate ignores respondent preferences.  If a pollster publishes only respondent preferences and primaries, I will calculate my own last-election 2PP from the primaries. If a pollster releases only respondent preferences and no primaries, the poll may be aggregated at one-third weighting, but only if the poll has satisfied due diligence, meaning that they have answered all my questions about how they are doing respondent preferencing and I am satisfied with their answers.  An example of a practice that does not satisfy due diligence is asking Nationals voters for a second preference - because the vast majority of Nationals voters live in electorates where their candidate makes the final two, so very few Nats votes are distributed.

House Effect Correction:

Poll types by a specific pollster are applied a house effect correction when it is considered that they are displaying a house effect in favour of a particular party.  This can either be a systematic long-term feature of that poll series or can reflect a pollster developing a sustained medium-term difference from the tracking shown by other polls.   The minimum house effect correction is 1 point, so if a pollster is suspected of having a house effect smaller than that, then that is treated under Global House Effect (see below).

The current house effects applied are:

No polls have current house effect corrections.

Global House Effect Correction:

A global house effect correction of up to a point is applied to the aggregate if it is considered that the pollsters included in it (after the adjustments above) tend on average to lean to one side or the other.  This is partly determined by considering the performance of polls at a range of election campaigns, with federal elections considered most important but state elections also taken into account.  However short-term changes in house effect in specific polls are also taken into account. The frequency of the polls included is also taken into account - if polls that lean slightly to one party are more common or more heavily weighted than those leaning slightly to the other, then this is corrected for.

The current global house effect correction is +0.2 added to Coalition.

Sample size:

No scaled adjustment is made for sample size at present (since most reliable national polls use broadly similar sample sizes which they increase at election time), except that a poll with a sample size of less than 950 automatically has its value in the aggregate halved.

Provisional vs Final Readings for a poll:

Sometimes only some of the results of a poll (such as the 2PP but not the primaries, or not all of the primaries) are available for a short amount of time.  In these cases I will often post a provisional update for that poll based on what is available and then post a final update when all data are available.

Unusual polls:

From time to time a poll is available that includes data from around half the Australian voter base.  Examples are a Galaxy poll of only female voters, and a ReachTEL poll of only NSW and Victorian voters. In these cases a national figure is estimated based on known differences between the surveyed and non-surveyed components, and the poll is aggregated at half its normal value.  However such polls by pollsters without well-established national 2PP tracking (eg JWS Research bulk polls of dozens of "marginal seats") are not included.  Single-state polls of federal voting intention are also not included.

New poll types:

A new poll type by an established pollster may be either included immediately, or excluded until three polls by that type have been conducted.  The new poll type is always excluded until three polls by that type are conducted if it appears to have a substantial house effect.

Example tracking:

The following is example of tracking for the current parliament four weeks after the condition for using the aggregate was first met:


Labor has led throughout this period (and by 2016 preferences on every poll released since the election), although by my assessment this lead is relatively narrow.  The aggregate more heavily weights Newspoll than Essential and assumes both usually lean slightly to Labor based on form in the previous parliament, although neither did so in their final poll.

Here is the final output of the smoothed aggregate for the previous parliament:



Seat Projections and Interpretation:

Formal seat projections will be added in the leadup to the an election.

For the time being, while the Mackerras pendulum has Labor requiring a 50.7% 2PP based on uniform swing to win more seats than the Coalition at the next election, it really shouldn't be that hard.  Labor has a massive advantage in personal votes arising from its many seat wins at the 2016 election.  This may well be blunted by redistributions (or if any of the new MPs don't recontest), but for now I estimate Labor needs a 2PP of only 50.1% for a 50% chance of winning more seats than the Coalition (ignoring any possible crossbench changes).  It is similar to the situation at the 2001 election, at which a 1.93% swing in its favour gave the Coalition not the eight seat gains implied by the pendulum, but only two.

Update April 2018: The draft redistribution, should it be finalised without significant changes, appears to make Labor's task even easier.  Pending the finalisation of the redistribution (after which I can make seat-by-seat changes to personal vote shares based on it) I estimate Labor now needs a 2PP of around 49.8% for a 50% chance of forming government.  Here I am assuming that if there is a 73-73-5 seat distribution in the new parliament, there will be a 50-50 chance of each major party governing.

Smoothing

A smoothed version of the aggregate is used as a basis for graphs.  The smoothed version for a given week is calculated by the following formula:

(current reading*2 + previous reading + next reading)/4

or where the next reading has not happened yet:

(current reading*2.5 + previous reading*1.5)/4

Conditions for operation:

The aggregate is suspended (frozen at its existing value) at any time at which the number of individual polls included in it falls below three.  However, a suspension does not apply if it is caused by a weekly reset and it is expected that a poll will be released based on its regular schedule in the coming week.  (In this case the aggregate remains at its previous level and the weekly reset is applied only when the new poll arrives.)  When the number falls below five a cautionary note is added.

Mandatory reset events (all existing polls discarded and the aggregate restarts after three new polls) are (i) a federal election for the full House of Representatives is held (ii) there is a change of Prime Minister.  Resets may be applied in other cases but this will only occur, if at all, with great reluctance!

Other Aggregators:

Several aggregators operated during the 2016 election cycle, some of which have since been discontinued.  Links to good-quality aggregators, will be added here as they emerge from hiding.  Dodgy ones may be linked as well (if so with a critical comment!) if they are high-profile.

Bludger Track
Mark the Ballot

Log of adjustment and methods changes:

All changes made to the pollster-specific and global house effects, and any other changes made, will be listed here.

27 Nov 2016: Global house effect correction reduced from +0.4 to +0.2 for Coalition following re-emergence of Ipsos.

15 May 2017: Global house effect correction removed.  Note on respondent preferences expanded.

26 July 2017: YouGov-Fifty Acres added, with initial weighting of 0.6. (NB 28 July At first I was applying a one-point house effect correction for it in Labor's favour but this was found to be an error at this stage and has been removed.)

25 Sep 2017: Global house effect correction changed to +0.1 to Coalition.

28 Mar 2018: Provisional decision not to include return of Morgan Face-to-Face at least until third poll available.

15 Apr 2018: Multiple changes:

* Galaxy/Newspoll 2PPs are now calculated directly from the primaries because of strong evidence of a shift in calculation methods away from pure last-election preferences since December 2017.  Values for most recent two Newspolls have been altered in the current aggregate.
*Global house effect correction changed to +0.2 to Coalition, primarily on account of apparent demise of Fifty Acres series.
*Morgan Face-to-Face still not included as third poll was not independent of second.
* Explanation of One Nation adjusted figure added above and One Nation adjustment method defined and recalculated.

22 Apr 2018:  I came up with a better idea for Galaxy/Newspoll, as follows:

For Newspoll specifically, the following applies for Newspolls from December 2017 onwards.  The average difference between the last-election 2PP calculated from the primaries and the published 2PP for all Newspolls in that period (as of late April 2018 this is 0.8 points) is added to the Coalition 2PP.  Then the average above is found (subject to the above constraints) and then the average difference is subtracted.  Thus for the late April Newspoll (published 2PP 49 for Coalition, last-election 2PP 47.6 for Coalition, I add 0.8 to the last-election 2PP to get 48.4, average that with 49 to get 48.7, and deduct 0.8 to get 47.9, which is the aggregated value.)

28 Aug 2018: Quality weighting changes as follows:

* YouGov Galaxy/Newspoll: Decreased from 1.35 to 1.30 because of difficulty converting to last-election preferences caused by new preferencing method.

* ReachTEL: Decreased from 1.20 to 1.10 because of increased difficulty in obtaining full primaries including full breakdown of so-called "undecided"

* Essential: Increased from 0.75 to 0.85 because of more natural tracking performance

* Ipsos: Decreased from 0.90 to 0.80 for persistently having primary votes of Greens and Others too high.

* Morgan: Face-to-face re-included, but decreased from 0.40 to 0.30 because of post-hoc release tendencies and having primary votes of Greens too high and One Nation too low.

1 Sep 2018: One Nation adjusted preferences changed from 58.2 to 60.1 following Longman by-election.

13 Feb 2019: Morgan face-to-face house effect removed as average skew has dropped to 0.7 points to Coalition from a small sample and series had a past history of skewing to Labor.

1 April 2019: Requirement to suspend aggregate when there are not three fresh polls but a poll is expected within days have been changed.  This was resulting in brief suspensions of the aggregate.  Instead the aggregate will remain unsuspended but the weekly reset will be deferred until the new poll arrives.

8 April 2019: Newspoll/Galaxy correction method altered so that the difference between the Newspoll/Galaxy published 2PP and the 2PP implied by the primaries is now taken off the average of the last ten Newspolls/Galaxies.  Previously it was the average of all differences back to December 2017.  The reason for the change is the fall in the One Nation vote.

28 April 2019: Note added about UAP preferences.

29 April 2019: New model added for Newspoll/Galaxy to cater for way in which UAP preferences are being allocated by the pollster.

No comments:

Post a Comment

The comment system is unreliable. If you cannot submit comments you can email me a comment (via email link in profile) - email must be entitled: Comment for publication, followed by the name of the article you wish to comment on. Comments are accepted in full or not at all. If you submit a comment which is not accepted within a few days you can also email me and I will check if it has been received.