Words with Reps: Map of Vocab on the House Floor
What am I looking at?
This map shows the breadth of vocabulary of each of the nation’s 435 voting Representatives. A darker green color indicates that a Rep has a larger vocabulary. There are 6 non-voting Representatives excluded from the map, though they are included in the rankings. The map is colored by each Rep’s “SQPD Ranking” (more details below), which measures a Representative’s usage of 3,393 different words that might be found on the SAT.
What are the numbers that appear when I click a district?
- Word Diversity Rank reflects how frequently a rep uses SAT words vs. usage of a basket of common words. The common words include: ‘are’, ‘one’, ‘they’, ‘was’, ‘were’, and ‘with’. The basket was specifically designed to be non-partisan. Oddly enough, some common words—even ‘one’, for example—show what appears to be a slight partisan tilt. [Teaser: subject of a future post.] The highest word diversity rank belongs to none other than Ronald Earnest Paul (aka Ron Paul).
- SAT Words Used (Count) reflects the number of different SAT words used by a Representative. The highest total (1,178) belongs to Sheila Jackson-Lee (D-TX). Not only is she highly educated, but she’s also been in office since 1995.
- SQPD Index, or the “sesquipedalian index”, is an aggregate measure of a Representative’s vocabulary. It includes both the SAT Words Used (Count), and Word Diversity Rank. Slightly higher weight is given to the count. Both factors are important, though, since certain Reps have a much, much longer record than others.
- SAT Words Used (Total) is the total number of SAT word utterances. In other words, if 10 SAT words were said 20 times, a Rep would have a score of 200. The highest total (5,027) also belongs to Sheila Jackson-Lee.
Who have the 10 highest scores?
In order, the Representatives with the 10 highest SQPD Index values are as follows:
- Sheila Jackson Lee, (D-TX): 4.79
- Christopher H. Smith, (R-NJ): 2.98
- Dennis Kucinich, (D-OH): 1.00
- Steve King, (R-IA): 0.99
- Barney Frank, (D-MA): 0.97
- Charles Rangel, (D-NY): 0.95
- John Conyers, (D-MI): 0.92
- Ronald Ernest Paul, (R-TX): 0.87
- Steny Hoyer, (D-MD): 0.84
- Marcy Kaptur, (D-OH): 0.84
What data sources were used to create this map?
The Congressional Record is the “official record of the proceedings and debates of the United States Congress.” The Sunlight Foundation, through its Capital Words API, processes the Congressional Record every day. They currently maintain data from 1996 onward. You can query word or phrase frequency by political party, over time, by representative, and so on.
The list of SAT words was obtained from two sites, freevocabulary.com and majortests.com. There were 5,372 words on the original list, although some of them hardly seem like SAT words (‘further’, ‘believe’, ‘off’), while others are mainstays of Congressional dialogue (‘foreign’, ‘unanimous’, ‘bureaucracy’). So, I excluded all words said more than 500 times from the final list, leaving 3,393 SAT words. The most common of these are ‘barring’, ‘culprit’, ‘passive’, and ‘revert’.
Finally, the map itself was created using Google Fusion Tables. Here’s a link to the table. Enjoy!
- myhandmadejewelry likes this
- angela-hunt reblogged this from dfkoz
- laphotos reblogged this from sunfoundation
- photosnew reblogged this from sunfoundation
- hilariouslywizardblonde reblogged this from dfkoz
- lovebeautifulthings likes this
- lovebeautifulthings reblogged this from dfkoz
- e-tag reblogged this from npr
- withoutayard reblogged this from dfkoz
- tomzshow likes this
- northonapa likes this
- yeahwellyourface reblogged this from dfkoz
- stakooza reblogged this from dfkoz
- udderfly reblogged this from dfkoz
- rosamour reblogged this from npr
- shabrittnaynay reblogged this from dfkoz and added:
- myrthablackswan reblogged this from npr
- dugbaker likes this
- dugbaker reblogged this from npr
- ericgeller likes this
- ay-caramba13 likes this
- caramelkite likes this
- danibeef reblogged this from dfkoz and added:
- monicaonline likes this
- corinneavital reblogged this from onthemedia
- sjwhipp reblogged this from dfkoz
- snowgray reblogged this from dfkoz
- onthemedia reblogged this from dfkoz
- milwaukeestat reblogged this from dfkoz
- bryckk reblogged this from dfkoz
- bryckk likes this
- moradiogirl reblogged this from dfkoz
- damnsmartblueboxes likes this
- wrench-wench reblogged this from dfkoz and added:
- thelesbianwhowouldbeking reblogged this from dfkoz
- oldparasitesingle likes this
- borisbeyeltsin reblogged this from npr
- iycrmm reblogged this from npr
- iycrmm likes this
- thebattleofwits likes this
- singingalong reblogged this from npr
- lajefita likes this
- johntedesco reblogged this from lifeandcode
- johntedesco likes this
- beyondareasonablestout reblogged this from npr
- yangguangyuan reblogged this from npr
- urmonotheismus likes this
- principia-coh reblogged this from ginamak
- numenia likes this
- chimericalideals likes this