Category: Data Analysis

The sex gap

By Razib Khan | April 22, 2013 4:14 am

I was recently reading Sexual Behavior in the United States: Results from a National Probability Sample of Men and Women Ages 14–94. At N ~ 6,000 it’s a large sample of American sexual behavior around 2010. There was one descriptive result which I thought was interesting, though not surprising. Before the age of 25 it seems that women are more likely to have sex in a given year than an equivalent age man. After the age of 25 this starts to reverse, and men are more likely to be having sexual intercourse in a given year. The dynamics underlying this phenomenon seem to be easily subject to various speculations, so I’ll leave that to readers. Rather, I offer the graph (data drawn from the paper linked above):

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Data, Sex

Lots and lots of genotype data!

By Razib Khan | January 9, 2013 12:22 pm

The Estonian Biocentre has put up lots of data in Plink pedigree format. Format is important because now many of you can manipulate these data. This the source of some of the data in PHYLO, so I need to double check the IDs or rebuild the data set (though for basic exploration I think PHYLO is still sufficient). This data set, combined with my ADMIXTURE tutorial should give everyone the tools to replicate results of the sort of papers which were cutting edge from 2006-2010 (i.e., model-based clustering with bar plots + PCA/MDS visualization).

Props to Mait Metspalu and company for release these data.

CATEGORIZED UNDER: Data Analysis

Religion determines politics for Asian Americans

By Razib Khan | November 11, 2012 2:38 pm

I was at ASHG this week, so I’ve followed reactions to the election passively. But one thing I’ve seen is repeated commentary on the fact that Asian Americans have swung toward the Democrats over the past generation. The thing that pisses me off is that there is a very obvious low-hanging fruit sort of explanation out there, and I’m frankly sick and tired of reading people ramble on without any awareness of this reality. We spent the past few months talking about the power of polls, and quant data vs. qual (bullshit) analysis, with some of my readers going into full on let’s-see-if-Razib-is-moron-enough-to-swallow-this-crap mode.

In short, it’s religion. Barry Kosmin has documented that between 1990 and 2010 Asian Americans have become far less Christian, on average. Meanwhile, the Republican party has become far more Christian in terms of its identity. Do you really require more than two sentences to infer from this what the outcome will be in terms of how Asian Americans will vote?

Below I took the data from Pew’s Religious Identification Survey in terms of how all Americans lean politically based on religion, and compared it to how Asian Americans lean based on religion.

Read More

CATEGORIZED UNDER: Data Analysis, Demographics

The general social survey: information is not free

By Razib Khan | October 19, 2012 12:57 am

Last week the GSS was down. I was very sad. The SDA team explains the situation:

Part of the popularity of our demonstration archive is that it is free for end users. We are happy to provide this service. It is a valuable resource for the academic community and it also publicizes the value of our SDA software. However, the flip side of providing this free service is that it does not generate any income to offset the cost of providing the infrastructure required. We receive no funding from GSS for hosting their datasets — which is often a surprise to our users. Almost all of our income comes from the fees provided by licensing the SDA software to other data archives (like ICPSR and IPUMS), and virtually all of that income goes to support the programming and technical support that we provide them. We obviously need some additional sources of revenue.

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: GSS

The relative decline of New York

By Razib Khan | October 7, 2012 10:38 pm

Despite the real estate bubble bursting, it looks as if Florida will surpass New York in population by the next Census. I once made some quick money by betting an older gentleman that Texas had a larger population than New York. I suspect there’s even more money to be made by betting people that Florida has a larger population than New York in a few years. The reality is that most people don’t check statistics in their free time, so some “facts” get frozen in their minds. A great number of adults alive today were told in elementary or secondary school that New York was the second largest state in population. They are unlikely to update their views as they age. Unfortunately, I suspect these confusions are going to lead to public policy problems as well. I am not confident that our elected officials are any more aware of statistics than their constituents.

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Florida, New York

Who cares about overpopulation? Smart people and atheists

By Razib Khan | October 2, 2012 7:53 pm

If you had the sense that Paul Ehrlich and Garrett Hardin are very much figures of the 1970s nexus of environmentalism and population control, it seems you are right. According to Google Ngrams mention of these topics has been declining since peaking during the oil crisis, in the afterglow of the influence of the late 1960s counter-culture. The general social survey has a variable, POPGRWTH, which asks:

And please circle one number for each of these statements to show how much you agree or disagree with it. The earth cannot continue to support population growth at its present rate.

The question was asked in the year 2000 and 2010. Demographic breakdowns below for the pooled responses….

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Data Analysis

Intelligence challenged people and free speech

By Razib Khan | September 15, 2012 6:23 pm

 

In the post below I took the time out to link to the GSS, as well as posting my exact queries. As payment for this consideration the first comment was absolute drivel. I understand people have political opinions, but I’m not too interested in your opinions. You may be interested in your opinions, but I’d rather have more data. Most people don’t know enough for me to have interest in their opinions (most != all, many readers do have opinions in their specialties which I seek out).

I was trying to make a point that anger and even violence in reaction to actions which offend are actually comprehensible as the modal human response. The community reacts to punish those who violate taboos. The taboos may differ, but the response to the action of violation is normal and natural. A primary issue that needs to be considered is that taboos differ from society to society, so one is often not conscious of the act of violation (e.g., if you show the bottom of your shoes to people when you sit down, that’s an offensive act in some societies).

An implication here is that American norms of free speech near absolutism, enforced through the fiat of the courts because of their interpretation of the applicability of the Bill of Rights, are radically non-intuitive to most people. The only reason that they are intuitive to many Americans is that we are acculturated over time. This is clear when you look at differences of intelligence and education. In short, less intelligent and educated Americans are much more skeptical of allowing social deviants to speak. This is true even in cases where they are more likely to agree with the deviant in question (e.g., these groups have a more pro-military bent, and yet are more accepting of the concept of censure of pro-military opinions).

I have limited all the results to the year 2000 and later. Additionally, I classify those who score 0-4 on the WORDSUM vocab test as stupid, those who score 5-8 as average, and those who score 9-10 as smart. WORDSUM has been reported to have a 0.7 correlation with general intelligence. In this data set 20% of individuals scored 0-4, 69% 5-8, and 12% 9-10.

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: GSS, Speech

The educated and conservative think fatness is a choice

By Razib Khan | September 1, 2012 9:17 pm

After the post on fatness and homophobia I decided to query the GSS on the extent to which people think that fatness has a strong biological element, similar to homosexuality. There’s a variable, GENENVO1. It asks:

Character, personality, and many types of behavior are influenced both by the genes people inherit from their parents and by what they learn and experience as they grow up. For each of the following descriptions, we would like you to indicate what percent of the person’s behavior you believe is influenced by the genes they inherit, and what percent is influenced by their learning and experience and other aspects of their environment. The boxes on handcard D1 are arranged so that the first box on the LEFT (which is numbered 1) represents 100% genetic influence (and 0% environment). The next box (numbered 2) represents 95% genes (and 5% environment), and so on. The RIGHTMOST box (numbered 21) represents 100% environmental influence (and no genetic influence). After each description, please type the number of the box that comes closest to your answer. Please use the numbered scale on handcard D1 to indicate, FOR EACH OF THE BEHAVIORS DESCRIBED, what percent of the person’s behavior you think is influenced by the genes they inherit, and what percent is influenced by their learning and experience. After each question, type the number of the box that comes closest to your answer. Remember, the higher the number, the more you think the behavior is influenced by learning and experience; the lower the number, the more you think it is influenced by genes. Carol is a substantially overweight White woman. She has lost weight in the past but always gains it back again.

Yes, the question itself is somewhat scientifically incoherent. Heritability doesn’t really work this way, but in the colloquial sense it is not an unreasonable question to ask, as it gauges real sentiment. Because the response are in five point increments, I combined the intervals 0 to 25% and 75 to 100%, and left the middle as a separate category. I crossed that with a host of demographics, and also re-ran the analysis for non-Hispanic whites only.

Before I report the results I’ll stipulate a few things (this might preempt me having to ban people who sincerely leave long, but unpleasant, comments). I accept that weight is substantially heritable, but I do not believe that the levels of obesity that we see in the United State are inevitable. But, I do also believe that there is a “moral panic” of sorts about obesity in the United States. Much of the attack on obesity which is grounded in real concerns about health also does rely upon the genuine loathing and disgust toward fat people which is widespread in American society. Additionally, there is a class dimension here, insofar as in the United States being grossly obese is more emblematic of the lower orders. All that being said, I think it is important to acknowledge that the vast majority of obese people would be happier, and live more fulfilled lives, if they weren’t obese. Though this doesn’t entail that I agree with criminalizing obesity, it does mean that I think that the “fat acceptance movement” is misguided. Rather than acceptance of fat, people need to be more generally civilized toward a level of inter-personal kindness which would diminish a whole host of cruelties. We don’t need to “liberate” fat people. We just need to “not be dicks.”

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Obesity

The future of the three "Pakistans"

By Razib Khan | August 29, 2012 10:55 pm

Over at Econlog Bryan Caplan bets that India’s fertility will be sup-replacement within 20 years. My first inclination was to think that this was a totally easy call for Caplan to make. After all, much of southern India, and the northwest, is already sup-replacement. And then I realized that heterogeneity is a major issue. This is a big problem I see with political and social analysis. Large nations are social aggregations that are not always comparable to smaller nations (e.g., “Sweden has such incredible social metrics compared to the United States”; the appropriate analogy is the European Union as a whole).

Read More

CATEGORIZED UNDER: Data Analysis, Demographics
MORE ABOUT: India, Pakistan, Population

Non-whites consistent on "life" issues

By Razib Khan | August 27, 2012 12:07 am

Over at Darwin Catholic a commenter asked whether a pro-choice commenter on this weblog also supported the death penalty. I presume that they were here pointing to the consistent life ethic issue. Many liberals who oppose capital punishment support abortion rights, and many conservatives who support capital punishment oppose abortion rights. These camps both have their viewpoints, which I’m not interested in re-litigating in the comments. But I was curious as to the overall societal support for the combinations of positions.

So I looked at the GSS, using the CAPPUN and ABANY variables (capital punishment, and abortion for any reason). In this post I will show you screenshots of the GSS output. It’s ugly, but it shows you deviation away from the expected proportions. Basically, if two variables are independent you can predict what you’d expect to be the crossed percentages over the four cells. If the results deviate from that you can ascertain particular associations. In the GSS output red means that the cell has a higher value than it should, and blue a lower value. Additionally, the intensity signals the magnitude of the deviation. I limited all results to the year 2000 and later.

First, the general aggregate result:

Read More

CATEGORIZED UNDER: Data Analysis, Ethics
MORE ABOUT: Death Penalty

Who rejects right to abortion in cases of rape?

By Razib Khan | August 21, 2012 11:56 pm

It’s basically impossible to avoid hearing about Todd Akin right now. My Twitter and Facebook feeds are kind of swamped. But it did make me wonder: what percentage of Americans reject abortion in cases of rape and incest? The GSS has a handy variable, ABRAPE, which asks respondents about the possibility of abortion if a woman gets pregnant as a result of rape (let’s stipulate that it’s possible to get pregnant as a result of rape!). I also limited the sample to the year 2000 and later, and non-Hispanic whites (to clear out confounds). Demographic breakdowns below….

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Abortion, GSS, Todd Akin

More GSS, less speculation!

By Razib Khan | August 20, 2012 11:32 pm

Long time readers know that one of my pet hobby-horses is to try and convince more pundits that they should use the GSS. Opinions based on opinions may be fun, but opinions based on facts may be useful. In general my appeals have fallen on deaf ears. But today I notice that Will Saletan is using GSS data to discussion the Todd Akin case. You may not agree with Saletan’s take on the results, but at least he bothered to generate some results.

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: GSS

More racist: white liberals or white conservatives?

By Razib Khan | August 19, 2012 11:02 pm

Reihan Salam has a post up on the alignment of racism and political orientation. He begins:

Recently, Chris Hayes, host of MSNBC’s UP with Chris Hayes, made the following observation:

It is undeniably the case that racist Americans are almost entirely in one political coalition and not the other.

Chris is a good friend of mine, and we grew up in the same milieu. I can attest to the fact that the view he expressed is very widely held in the circles in which we both travel….

Salam then links to Alex Tabarrok, who uses party identification data to indicate that actually racism is split between the two groups, while John Sides suggests that there is a definite lean toward Republicans being more racist, using a few indicator variables. Overall I think Sides is about right, all things equal conservatives are more racist than liberals. At least in the modern context of the two ideologies.* I say conservative/liberal rather than Republican/Democrat, because my experience with the GSS data set is that ideology is a more powerful predictor of social views among whites. This holds true with the variables which Tabarrok and Sides query from what I can see; the gap between Democrats and Republicans is smaller than between liberals and conservatives. Why? There are still a non-trivial number of self-identified conservative Democrats in this country, as well as very well off socially moderate Republicans who vote their economic interests.

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: GSS, Interracial, Racism

Who shall inherit the earth?

By Razib Khan | August 15, 2012 10:44 pm

There was a question below in regards to the high fertility of some extreme (“ultra”) religious groups, in particular Haredi Jews. The commenter correctly points out that these Jews utilize the Western welfare system to support large families. This is not limited to just Haredi Jews. The reason Somalis and Arabs have fertility ~3.5 in Helsinki, as opposed to ~1.5 as is the norm, is in part to due to the combination of pro-natalist subcultural norms, and a generous benefits state. Of course we mustn’t overemphasize economics. Israel’s decline in Arab Muslim fertility but rise in Jewish fertility in the 2000s has been hypothesized to be due to different responses to reductions in child subsidies by Muslims and the Haredi Jews. In short, the former reacted much more strongly to economic disincentives in relation to the latter.

A bigger question is whether exponential growth driven by ideology can continue indefinitely. I doubt it. Demographics is inevitable, but subject to a lot of qualifications. Haredi political power in Israel grants some benefits, but at the end of the day basic economics will serve as a check on the growth of the population of this sector. Similarly, barring massive productivity gains we’ll see some structural changes to the provision of government services across the aging developed world.

Below are some fertility numbers from the GSS. You see the median number of children for non-Hispanic whites born before 1960 from the year 2000 and later. I’ve compared the demographics of fundamentalists, non-fundamentalists, and those who are skeptical of the revealed nature of the Bible.

Read More

CATEGORIZED UNDER: Data Analysis, Demographics
MORE ABOUT: Demographics, Fertility

What is the distribution of offspring per individual?

By Razib Khan | August 9, 2012 8:58 am

A commenter below notes:

Also, in modern society, doesn’t just about everyone reproduce, such that not only is any particular advantage competing against other countervailing pressures as you note, but also that the “less fit” genomes are not removed from the overall population, but rather are added back to the mix? In other words, the less-preferred short males don’t die and have zero kids, they also get married and their genes get thrown back into the pot.

First, let’s not get caught in the assumption that for genes to be disfavored one has to have zero fitness in individuals carrying those genes. If, for example, in a situation of demographic expansion you had individuals who had eight children vs. those who had one child, there would be selection for the traits which were passed by those with eight children in relation to those who had one child. But, it did make me realize I wasn’t intuitively aware of the distribution of number of offspring in the population. I assumed that the median was around two, but that’s about it.

So, I looked at the GSS CHILDS variable for individuals born in 1950 or earlier from the year 2000 on (COHORT and YEAR variables). I also separated out the results by sex. Do not take these results as definitive because the GSS data set is not entirely representative. But, it does give you a general sense.

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Data

College makes you believe in marriage!

By Razib Khan | July 29, 2012 11:05 pm

There’s a cliche, which isn’t totally false, that more education tends to lead one toward heterodox viewpoints which challenge conventional norms. But one issue that has been coming to the fore over the last 10 years or so is that college educated Americans tend toward social liberalism, and yet often continue to live very bourgeois lives. In other words, the freedoms which they favor are those freedoms which are ever operative in their own lives. In contrast those Americans without college educations tend to have a less libertarian attitude toward personal mores, but have lives characterized by greater disturbance and disastrous choices.

And yet this does not hold in the case of what articles such as this report, How Divorce Lost Its Groove:

Though she wasn’t entirely surprised. Ever since her divorce three years ago, Ms. Thomas said, she has been antisocial, “nervous about what people would say.”

After all, she had gone from Park Slope matron, complete with involved husband (“We had cracked the code of Gen X peer parenthood”) and gut-renovated brownstone, to “a Red Hook divorcée,” she said, remarried with a new baby and two children-of-divorce barely out of preschool. “All of a sudden, this community I’d lived in for 13 years became this spare and mean savannah,” she said.

It was as if, she said, everyone she knew felt bad for her but no one wanted to be near her, either. Even though adultery was not part of the equation, Ms. Thomas said, “I feel like I have a giant letter A on my front and back.

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Divorce, marriage

People wanted more children in 2000s, but had fewer

By Razib Khan | July 16, 2012 7:19 pm

The readers of this weblog are relatively non-fecund, at least going by reader surveys. But I was curious nonetheless about the attitudes toward number of children, and realized goals of number of children, in the General Social Survey. I decided to look at two variables:

CHILDS

CHLDIDEL

The former asks the respondent how many children they had, the latter how many they’d like to have. I restricted the sample to whites ages 45-65 for every survey year. I then combined all the years of a particular decade, so you have 1970s, 1980s, 1990s, and 2000s. For demographics I looked at highest educational attainment, and household income indexed to 1986 real value dollars (so they are comparable across decades).

Two major takeaways:

1) Education matters more than income in terms of number of children. Having lots of education tends to reduce family size. No great surprise.

2) Ideal number of children increased in the 2000s, but the decline in average number of children continued.

There is often talk in the literature on the disjunction between ideal family size in Third World nations and the realized family size, with a larger number of children than women may want. What is less discussed is the inverse discussion. It seems that Americans want larger families than they manage to have. Of course, there is the distinction between avowed and realized preferences here.

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Demographics

Who objects to painful tests on animals?

By Razib Khan | July 13, 2012 7:39 pm

In the year 2008 1,400 people were asked this question in the GSS:

Scientists should be allowed to do research that causes pain and injury to animals like dogs and chimpanzees if it produces new information about human health problems. (Do you strongly agree, agree, disagree, or strongly disagree?)

(variable ANSCITST)

I was curious because I ran into some stuff on pro-life sites today about how animal rights activist don’t oppose abortion, and how hypocritical that is. So naturally I was curious about how attitudes varied on that issue.

Allow animal testing which might case pain
Strongly agree Agree Disagree Strongly Disagree
Yes to abortion on demand 8 31 39 22
No to abortion on demand 7 32 39 23

What the results above show is that there is almost no difference in attitudes toward animal research when you vary attitudes toward abortion on demand. In other words, 22 percent of pro-choice people oppose such research strongly, while 23 percent of pro-life people do. How does this vary by demographic?

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Animal Rights

Higher vocabulary ~ higher income

By Razib Khan | June 24, 2012 8:54 pm

Prompted by a comment below I was curious as to the correlation between intelligence and income. To indicate intelligence I used the GSS’s WORDSUM variable, which has a ~0.70 correlation with IQ. For income, I used REALINC, which is indexed to 1986 values (so it is inflation adjusted) and aggregates the household income. Finally, I limited my sample to non-Hispanic whites over the age of 30 (for what it’s worth, this choice also limited the data set to respondents from the year 2000 and later).

The results don’t get at the commenter’s assertions, because 10 out of 10 on WORDSUM does not imply that you’re that smart really. But the trendline is suggestive. Note that aggregated 0-4 because the sample size at the lower values is small indeed.

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Data, GSS, Income, IQ

Attitudes toward genetically modified crops & science

By Razib Khan | June 23, 2012 12:20 pm

In the further interests of putting quantitative data out their instead of vague impressions, I noticed two GSS variables which might be of interest. One queries the impression of effect on the environment of genetically modified crops. The second asks about whether science does more harm than good. The latter question exhibited almost no year to year variation of note, so I just threw them in a pot together. But for the environment and genetically modified crop question I show responses for the year 2000 and 2010. As you can see there is a modest difference in regards to the first where liberals are more skeptical.

Read More

CATEGORIZED UNDER: Data Analysis
MORE ABOUT: Data
NEW ON DISCOVER
OPEN
CITIZEN SCIENCE
ADVERTISEMENT

Discover's Newsletter

Sign up to get the latest science news delivered weekly right to your inbox!

Gene Expression

This blog is about evolution, genetics, genomics and their interstices. Please beware that comments are aggressively moderated. Uncivil or churlish comments will likely get you banned immediately, so make any contribution count!
ADVERTISEMENT

See More

ADVERTISEMENT

RSS Razib’s Pinboard

Edifying books

Collapse bottom bar
+

Login to your Account

X
E-mail address:
Password:
Remember me
Forgot your password?
No problem. Click here to have it e-mailed to you.

Not Registered Yet?

Register now for FREE. Registration only takes a few minutes to complete. Register now »