New Philosophy of Social Science: November 2013

Wednesday, November 13, 2013

Cruickshank on philosophical issues with critical realism

Justin Cruickshank is an interesting commentator on the philosophical underpinnings of critical realism. Critical realism was developed initially by Roy Bhaskar in A Realist Theory of Science and The Possibility of Naturalism: A philosophical critique of the contemporary human sciences, and has been further elaborated by a number of philosophers. The theory is now playing a lively role within sociology and sociological theory. Cruickshank’s key ideas are developed in several papers, “A tale of two ontologies: an immanent critique of critical realism” (2004) (link), “Knowing Social Reality: A Critique of Bhaskar and Archer’s Attempt to Derive a Social Ontology from Lay Knowledge” (2010) (link), and “The positive and the negative: Assessing critical realism and social constructionism as post-positivist approaches to empirical research in the social sciences" (2011) (link). Fundamentally Cruickshank takes issue with the nature of the arguments that critical realists have offered for their specific ideas about ontology.

Cruickshank regards the doctrines of critical realism as expressed by Bhaskar and his successors as fundamentally a philosophical theory rather than a highly general and abstract social theory; and he finds that the theory is justified on several lines of philosophical argumentation. The arguments that he criticizes involve apriori philosophical reasoning and inference from lay concepts about the natural and social worlds.

"A tale of two ontologies" highlights the philosophical presuppositions and language of critical realism — assumptions about the variants of ontology (transitive and intransitive), absolute metaphysical knowledge, transcendental metaphysical knowledge, conceptual science, immanent critique. Cruickshank finds that Bhaskar embraces the idea that critical realism is a philosophical theory rather than a scientific theory, and that this places the theory on shaky ground:

In support of the differentiation of philosophy from science, and contrary to the claim made about the historical transitivity of ontology made in response to Chalmers, Bhaskar says he avoids the epistemic fallacy by producing a philosophical ontology. He argues that if we conflate scientific and philosophical ontologies then we commit the epistemic fallacy, by remaining confined within questions about knowledge. (573)

The transcendental method that Bhaskar uses, according to Cruickshank, is based on Kant’s philosophical theories:

Against empiricism, Bhaskar’s transcendental realism (which was later renamed ‘critical realism’) holds that the condition of possibility of science is the explanation of causal laws which are different from the changing contingent observable regularities we may perceive outside experiments. The ontological turn advocated in RTS is meant to render explicit the ontological presuppositions implicit within the practice of science. In doing this, Bhaskar argues that the condition of possibility of science is the existence of underlying causal laws in open systems (i.e. systems characterised by change with no observable constant conjunctions), rather than causal laws being observed constant conjunctions within artificial closed laboratory systems. (569-570)

But this method leads to a conundrum:

The version of ontology required to allow critical realism to fulfill its hegemonic project rests on a dogmatic metaphysical claim to know a stratum of ultimate reality beyond knowledge. Critical realists try to avoid such explicit dogmatism by defining ontology in terms of the transitive domain rather than the intransitive domain. However, defining ontology in terms of the transitive domain commits the epistemic fallacy, and precludes any possibility of the ontology being used as the basis for an hegemonic project, as the ontology would be fallible and hence open to revision (unless dogmatically privileged). [my italics] (581)

So Bhaskar et al have painted themselves into a metaphysical corner: they require that ontology should be about reality as it really is (intransitive); they retreat from the implication of a dogmatic philosophical position; and they wind up in the position of conceptual relativism (transitive domain) that they sought to avoid.

Cruickshank plainly prefers to deal with these issues in a way that is not so dependent on purely philosophical arguments. Here is the position that Cruickshank thinks is most reasonable:

We may accept the view that ontological questions are important questions, and argue that we ought to regard ontological theories as fallible interpretations of reality. In other words, the focus in this article is on the status claimed for ontology, and not the issue of wether one or other substantive social ontology is the definitively correct or incorrect definition of social reality. The emphasis is on continually developing ontological theories through critical dialogue, rather than arguing that an individualist, or structuralist, or praxis based ontology, etc., is the correct definition of social reality. (568-569)

...

In contrast to foundational epistemology which defines reality to fit a subjective, mentalistic foundation, we may adopt an anti-foundational approach that rejects the starting point of epistemology as the separation of the lone mind from the world. We may instead hold that our beliefs are engaged with the world and that we need to revise and replace our theories in the course of our engagement in the world. (582)

...

As regards social ontology this means that social scientists need to become engaged in an on-going debate about the ontological theories currently existing in the transitive domain. This debate needs to turn not just on the use of immanent critique, to assess the internal coherence of a position, but also on the usefulness of an ontology in informing empirical work. (583)

And in fact, this seems like an entirely defensible way of thinking about the role of ontology: not as a set of philosophical truths to be established by a priori arguments, but rather as a revisable set of ideas coherently related to the best scientific conceptual systems we have developed to date.

Wednesday, November 6, 2013

Structural realism and social realities?

The topic of realism has come up frequently here -- causal realism, critical realism, scientific realism. Each of these realisms comes out of somewhat different fields of questions and assumptions. Within mainstream philosophy of science there is another realism that has been debated in the past twenty years, referred to as structural realism. The view has been developed by philosopher John Worrall, and his 1989 article "Structural Realism: The Best of Both Worlds?", sets the stage (link). So what is this view, and does it have any relevance to the social sciences?

First, what is the view? It is a refinement to the theory of scientific realism advocated by philosophers like Hilary Putnam and Dick Boyd -- the view that we have reason to believe that the world has approximately the features attributed to it by the best available scientific theories. As Boyd put the view quite a few years ago, what else could explain the success of those theories if not their approximate truth and successful reference to the entities and properties of the world?

The problem that gives rise to structural realism is what Worrall calls the "pessimistic meta-induction" (109): in the history of science, most scientific theories have eventually been proven to be false. So how can scientific realists claim, after all, that there is a rational basis for believing that the world has the characteristics asserted by the current generation of scientific theories? The answer to this question, Worrall argues, comes down to a judgment call about the history of science: "just how radical theory-change has standardly been in science" (105). If successor theories have nothing in common with their antecedents except a broader but overlapping range of empirical consequences, then it is hard to say that there is an approximate truth that is captured by both stages of the theory. "If, on the contrary, the realist is forced to concede that there has been radical change at the theoretical level in the history of even the mature sciences then he surely is in deep trouble" (107). Realism, then, depends on some degree of approximate continuity across successor theories. Here Worrall turns to Richard Boyd:

"The historical progress of the mature sciences is largely a matter of successively more accurate approximations to the truth about both observable and unobservable phenomena. Later theories typically build upon the (observational and theoretical) knowledge embodied in previous theories." (Boyd, 1984, "The Current Status of Scientific Realism" in Leplin, ed., Scientific Realism)

But many philosophers and historians of science have disputed the degree of continuity that Boyd postulates here. They emphasize the discontinuities that often occur across the process of theory change in physics. However, Worrall argues that there is a more abstract way in which physical theories show substantial continuity. This continuity isn't found at the level of entities and causal powers, but rather a set of more abstract characteristics that are attributed to the features of the world under study.

Structural realism gets going, then, if we concede that the history of physics shows radical change at the level of the properties attributed to natural objects but we maintain that it also shows a strong degree of continuity when it comes to the basic structural properties that are postulated by theories of physics.

In application to the series of theories offered to explain the behavior of light, the continuity was abstract:

There was continuity or accumulation in the shift [from Fresnel to Maxwell], but the continuity is one of form or structure, not of content. (117)

Worrall attributes this idea about a specific but abstract kind of continuity in physics to Henri Poincare, and he argues that it lays the basis for a weaker form of realism that might be described as syntactic or structural realism(117).

Roughly speaking, it seems right to say that Fresnel completely misidentified the nature of light, but nonetheless it is no miracle that his theory enjoyed the empirical predictive success that it did; it is no miracle because Fresnel's theory, as science later saw it, attributed to light the right structure.... There is no elastic solid ether. There is, however, from the later point of view, a (disembodied) electromagnetic field. The field in no clear sense approximates the ether, but disturbances in it do obey formally similar laws to those obeyed by elastic disturbances in a mechanical medium. (117-118)

So structural realism when applied to the history of the theory of light says two things: successor theories had radically different and inconsistent hypotheses about the mechanics and substance of light; but they agreed approximately about the mathematical properties of light. And it is the latter that is preserved across the progress of this area of science.

This is a very weak form of realism, as Worrall acknowledges:

[The structural realist] insists that it is a mistake to think that we can ever "understand" the nature of the basic furniture of the universe.... On the structural realist view what Newton really discovered are the relationships between phenomena expressed in the mathematical equations of his theory, the theoretical terms of which should be understood as genuine primitives. (122)

So the commonsensical questions we might want to ask of contemporary physics -- are there electrons, is space curved, is the speed of light constant -- do not have defensible answers, according to structural realism. What the success of modern physics allows us to conclude is something much weaker: whatever the fundamental components of matter, space, time, light, and gravity are, the world conforms to the mathematical transformations that are specified by our best confirmed contemporary physical theories. It is the transformations, equations, and constants that we can be realistic about, not the concrete theories of the mechanics of the things that embody these equations.

My real interest in opening this topic was to consider whether it has any relevance to the social sciences. And the short answer seems to be -- not much. Theories in the social sciences rarely have the mathematical specificity that is crucial to the structural realist argument. So it is difficult to make the argument that Ricardo, Marx, Pareto, and Keynes were describing the same structural reality when they wrote about capitalism. Their substantive assumptions are quite different; but further, the expected "mathematical" behavior of the capitalist market system is also substantially different across the theories. Perhaps a more plausible case is the transition from Marx's classic theory of exploitation, based on the labor theory of value, to John Roemer's theory of exploitation in A General Theory of Exploitation and Class, based on neoclassical and game-theoretic economic assumptions. The two theories arrive at similar "structural" features of a capitalist economy, in spite of the fact that the underlying substantive assumptions are quite different.

(Katherine Brading and Elise Crull offer a very nice treatment of Worrall's interpretation of Poincare in "Epistemic Structural Realism and Poincare's Philosophy of Science (link).)

Sunday, November 3, 2013

Does the microfoundations principle imply reductionism?

My philosophy of social science has always and consistently maintained the idea that social facts depend on the activities and beliefs of individuals. There is no social "stuff" that exists independently from individual actors. I have encapsulated that idea in the form of the "microfoundations" principle: any claim about the characteristics or causal powers of social entities must be compatible with there being microfoundations for those properties and powers at the level of the actor.

At the same time, I also believe that there is an appropriate domain for social science: the exploration of the features and powers of the social world. I don't believe that methodology should force the sociologist to become a psychologist or to shift his/her attention to the micro level.

Are these two premises compatible? Or does the microfoundations principle actually entail reductionism? Does it imply that explanations couched at the level of social vocabulary are incomplete and derivative, and that the real explanation must be found at the level of the micro-activities of individuals?

I attempt to resolve this apparent dilemma by distinguishing between strong and weak versions of the microfoundations principle: "social explanations must provide microfoundations for their assertions about social properties and powers" versus "social explanations must be compatible with there being microfoundations for their assertions about social powers and properties." The weak version reflects an appropriate stipulation based on what we know about the ontology of the social world, whereas the strong version is a kind of explanatory reductionism that is unjustified.

My position, then, is that sociology is a special science in Fodor's sense, and that sociologists both can and do treat their domain as relatively autonomous.

Several commentators allege that my commitment to microfoundations -- which is unwavering -- vitiates my ability to claim relative explanatory autonomy for the meso level. Some don't like my distinction between weak and strong microfoundations, and others think that commitment to MF means explanations have to proceed through explicit discoveries of the MF pathways.

My position is intended to exactly parallel physicalism in cognitive science: we are committed to the idea that all cognitive processes are somehow or other embodied and carried out by the central nervous system. But we are not obliged to actually perform that reduction in offering a hypothesis and explanation at the level of cognitive systems.

Even more prosaically: we believe that the properties of metals depend upon the quantum properties of subatomic particles. Does anyone seriously believe that civil engineers aren't giving real explanations of bridge failures when they refer to properties like tensile strength, compression indices, and mechanisms like metal fatigue? We can observe and measure the metal's properties without being forced to provide a quantum mechanical deduction.

One observer writes that "Little's examples actually confirm that meso-level mechanisms work only through micro-level processes." Yes, and I likewise confirm that cognitive processes work only through neural events and material properties work only through quantum physics. But I don't accept that this demonstrates that the higher level cannot be treated as having real causal properties. It does have those properties; and we simply reaffirm the point that somehow or other those properties are embodied in the lower level elements. This isn't a new idea; it was contained in Jerry Fodor's "Special Sciences" article years ago. If the argument is generally a bad one then we are forced to undo a lot of work in cognitive science. If it is generally compelling but inapplicable to social entities then we need to know why that is so in this special case of a special science.

To be clear, I too believe that there is a burden of proof that must be met in asserting a causal power or disposition for a social entity -- something like "the entity demonstrates an empirical regularity in behaving in such and such a way" or "we have good theoretical reasons for believing that X social arrangements will have Y effects." And some macro concepts are likely cast at too high a level to admit of such regularities. That is why I favor "meso" social entities as the bearers of social powers. As new institutionalists demonstrate all the time, one property regime elicits very different collective behavior from its highly similar cousin. And this gives the relevant causal stability criterion. Good examples include Robert Ellickson's new-institutionalist treatment of Shasta County and liability norms and Charles Perrow's treatment of the operating characteristics of technology organizations. In each case the microfoundations are easy to provide. What is more challenging is to show how these social causal properties interact in cases to create outcomes we want to explain.

The best reason I am aware of to doubt stable causal powers for social entities is founded on the point that organizations and institutions are too plastic to possess enduring causal properties over time. I've made this argument myself on occasion. But researchers like Kathleen Thelen in The Evolution of Institutions demonstrate that there are in fact some institutional complexes that do possess the requisite stability.

So I continue to believe both things: that statements about social entities and powers must be compatible there being microfoundations for these properties and powers; and that it is theoretically possible that some social structures have properties and powers that are relatively autonomous, in the sense that we can allude to those properties and powers in explanations without being obliged to demonstrate their microfoundations.

Causal inference and random trials

image: Tamil Nadu nutrition study

Nancy Cartwright has spent much of her career probing the assumptions scientists make about causation. She has helped to demonstrate that the Humean assumptions about causation that philosophers (including Carl Hempel) carried into twentieth century philosophy of science don't come close to answering the question correctly, and she has provided many reasons to take seriously the ideas of causal powers and mechanisms rather than governing causal regularities. How the Laws of Physics Lie is an important contribution to the philosophy of science and to realist theory.

Her current book Evidence-Based Policy: A Practical Guide to Doing It Better (with Jeremy Hardie) provides a different critical perspective on causal inference, this time in the context of social policy reasoning. The design and implementation of public policies rest upon a fundamental premise: that we can have evidence-based reasons for predicting what the effects of the policy tool will likely be. But what kind of evidence might that be? The dominant form of evidence favored in the policy science field is random controlled trials: specify the policy intervention P, choose a domain of cases to apply the intervention P to, randomly select cases to receive the intervention (versus the control group that does not), and measure the value of the outcome of interest. If there is a significant difference in the value of the outcome between test group and control group, then we have evidence that P had an effect.

In a nutshell, C&H take issue with the conviction that random controlled trials (RCT) -- the gold standard of causal inference and experiment in clinical medicine -- provide a basis for expecting that a given policy intervention will have similar effects in the future. Their book can be read as a critique of an excessively statistical understanding of social causality, without realistic analysis of the underlying mechanisms and processes. As Cartwright and Hardie state repeatedly, RCT evidence shows only that the policy worked on the circumstances tested in the study. Instead, they argue that we need to offer evidence about two additional considerations: whether the "causal principle" associated with P will remain the same in new circumstances; and whether the associated conditions necessary for the operation of this principle will be present in the new circumstances.

Here is a fundamental statement of what they mean by a causal principle:

We suppose that causes do not produce their effects by accident, at least not if you are to be able to make reliable predictions about what will happen if you intervene. Rather, if a cause produces an effect, it does so because there is a reliable, systematic connection between the two, a connection that is described in a causal principle. (22)

The statement, "El Nino causes wet winters in North America," is a causal principle. But causal principles are neither universal nor exceptionless:

The fact that causal principles can differ from locale to locale means that you cannot read off that a policy will work here from even very solid evidence that it worked somewhere else. (23)

Here is a more extensive description of this idea:

Causal principles are not universal. They differ from place to place and from time to time. That means that it is not enough for you to know that the policy worked somewhere or even that it has worked at some time here. “It worked there”; it played a positive causal role there. So it was one of the factors from a causal principle that holds there. To predict that it will work here, you need to know that it is one of the factors from a causal principle that holds here. That is what ensures that it can play a positive causal role for you. (50)

Cartwright and Hardie look at causation along the lines of J. L. Mackie's analysis of INUS conditions in The Cement of the Universe: a factor is a cause if it is an "Insufficient but Necessary part of an Unnecessary but Sufficient condition for producing a contribution to the effect" (23). The evidence of favorable CRT studies for a given policy intervention doesn't show that this policy will work in the new circumstances of the new proposed application. In order to draw this inference we need to have confidence that the treatment will play the same causal role in the new setting, and that the necessary conditions will be present in that setting. In other words, we need a more detailed causal analysis of the past and the proposed future.

Here is a sketch of the argument that C&H suggest we need to provide in order to project favorable RCT studies onto future applications:

x works there (i.e., x genuinely appears in the causal principle that governs the production of y there post-implementation).

Here and there share that causal principle post-implementation.

The support factors necessary for x to contribute under that principle are present for at least some individuals here post-implementation.

Conclusion. x works here (i.e., x genuinely appears in the causal principle that governs the production of y here post-implementation and the support factors necessary for it to contribute to y are present for at least some individuals here post-implementation). (41)

One way of offering support for premise 2 is to engage in the method of process tracing:

This method confirms the existence of a causal connection between start and finish by confirming, one-by-one, a series of smaller causal steps in between. (38)

Cartwright doesn't put her case in these terms, but I would say that the heart of her intuition is that social outcomes are different from medical outcomes because of their inherent causal heterogeneity. In the social world outcomes like teen pregnancy rates or high school dropout rates are the result of a bundle of conjunctural causal processes. So projecting the results of past random controlled trials into the future requires that we first confirm that the same causal influences and important background conditions are at work. And this is rarely the case. So the fundamental underlying prescription is a pragmatic causal realism about social processes: in order to design and implement policies, we need to have a well developed map of the real causal processes and mechanisms that are underway in the production of the effect we would like to change. In other words, we need to be causal realists if we are to be effective policy makers.

(It is worth observing that this book is deliberately different in tone and specialization from Cartwright's other monographs in the philosophy of causation. The book is designed to be useful for real practitioners of public policy, and it offers clear advice about how to gain the understandings needed in order to validate the idea that a given policy will have desired effects in a novel setting.)

Organizations and strategic action fields

image: Hierarchical modularity of nested bow-ties in metabolic networks, Jing Zhao, Hong Yu, Jian-Hua Luo, Zhi-Wei Cao and Yi-Xue Li (link)

Neil Fligstein and Doug McAdam provide a full exposition of their theory of strategic action fields in A Theory of Fields. As observed in an earlier post, this theory presents an innovative way of thinking about the composition of the social.

The basic idea is that the fundamental structure of social life is "agents behaving strategically within a field of resources and other agents." Here is a preliminary description of strategic action fields.

A strategic action field is a constructed mesolevel social order in which actors (who can be individual or collective) are attuned to and interact with one another on the basis of shared (which is not to say consensual) understandings about the purposes of the field, relationships to others in the field (including who has power and why), and the rules governing legitimate action in the field. A stable field is one in which the main actors are able to reproduce themselves and the field over a fairly long period of time. (1)

Fligstein and McAdam do not give fundamental ontological status to structures or organizations, and they do not presuppose a dichotomy between agents and structures. Instead, organizations and institutions are ensembles of agents-in-fields, at a range of levels. Here is what they have to say about firms, which can be extended to organizations more generally:

Firms are nested strategic action fields in which there are hierarchical dependent relationships between the component fields. Each plant and office is a strategic action field in its own right. Typically firms are organized into larger divisions in which management controls resource allocation and hiring. (59)

This theory possesses microfoundations; this is the thrust of the second chapter in the book. Their account is largely organized around the idea of social skill at the level of the actor. What I want to explore here, though, is the "macro-sociology" of the theory. In particular, how do our concepts of meso-level social structures like institutions and organizations get parsed when we use the language of strategic action fields? And substantively, how can we account for the relative level of stability that organizations and institutions possess, if they are simply composites of strategically motivated actors? This description suggests a high degree of fluidity, as strategies and coalitions shift. But instead, we observe a high level of stability in organizations much of the time, persisting over multiple generations of actors.

The answer seems to derive from the idea that F&M introduce of "internal governance units."

In addition to incumbents and challengers, many strategic action fields have internal governance units that are charged with overseeing compliance with field rules and, in general, facilitating the overall smooth functioning and reproduction of the system. (13)

Organizations are configured around incumbents who are assigned roles and powers that give them both an interest and an ability to maintain the workings of the organization. So stability is not a primitive quality of an organization; instead, it is a consequence of the specific interlocking assignments of interests and powers within the various networks of agents that make up the organization. Stability is a dynamic feature of the organization, reproduced by the actions of incumbents. And change in the organization occurs when there is significant alteration in those interests and powers.

Field stability is generally achieved in one of two ways: the imposition of hierarchical power by a single dominant group or the creation of some kind of political coalition based on the cooperation of a number of groups. (14)

On this approach, then, stability is a consequence of the configuration of a given system of strategic fields, rather than an axiomatic property of the organization.

There is a great deal of consonance between this theory and the ideas about organizations and actors put forward by Crozier and Friedberg some forty years ago in Actors and Systems: The Politics of Collective Action; here is an earlier post on their work. Crozier and Friedberg too looked at organizations as arenas of strategic and opportunistic action by agents. They too emphasized the role of cooperation and alliances within organizations. And they too looked at organizations as solutions to problems of collective action. There is no indication that Fligstein and McAdam were directly influenced by Crozier, and indeed the research communities including both are fairly distinct. So this looks like a case of independent discovery of a new idea rather than sequential development of this idea. It looks more like the case of Wallace and Darwin in the discovery of natural selection, than Darwin and Huxley in the development of that idea.

Levels of the social

We can examine social life at many levels of granularity -- from ordinary individual social behavior to small groups to cities and regions to the global system of communication and extraction. Is there any basis for thinking one level is better than another for the social sciences?

There are two kinds of considerations that might be used as grounds for answering this question. One is about scientific feasibility and the other is about explanatory scope.

The feasibility line goes this way. It might be that higher level social phenomena are substantially less orderly than lower level phenomena. This means that we might be able to arrive at more confident and comprehensible analysis at the lower level than the higher level. Features of indeterminacy, contingency, and complexity might mean that we can't expect to have strong and empirically well supported analyses of ensembles like cities or trading systems. And we might find that studies of individual-level social behavior are more tractable and empirically defensible.

The explanatory scope consideration cuts in the opposite direction. We would like to be able to explain processes like urbanization, ethnic conflict, and the social role of religion. These processes are very interesting, and they are consequential as well. So we would like to have some reliable hypotheses about some of the causal dynamics that animate them. And studies that focus on individual-level processes may not shed much light on these higher-level processes.

So tractability perhaps pushes us towards the lower level, while an appetite for explanatory scope pushes us towards theorizing and investigating higher levels.

There is something appealing about a definition of the social sciences that tries to answer the actor-level kind of question: what are the drivers of real social behavior, in a variety of settings? What are the springs of individual action? How do environment and experience influence people's actions? This approach would fall within the sociological theory of the actor; it would largely overlap with social and developmental psychology, with a scoop of ethnomethodology on the side.

And this approach wouldn't be wholly limited to the individual. Some of the learning we do about cooperation, aggression, and social cognition might well provide a basis for explanation of high-level social phenomena such as ethnic conflict or the spread of agricultural practices.

But it also seems credible that we can learn some important things about the higher-level processes and structures as well. Political scientists have some robust ideas about how institutions work. Economists have succeeded in identifying some of the dynamics of trading systems and technology change. Urban sociologists are able to discern some of the processes of neighborhood transformation. So it is clear that there are higher level social processes, structures, and systems that are amenable to empirical and theoretical study.

A nice conjunction of research projects that illustrate this point can be found in the study of modern cities. Al Young (The Minds of Marginalized Black Men: Making Sense of Mobility, Opportunity, and Future Life Chances) and Loic Wacquant (Urban Outcasts: A Comparative Sociology of Advanced Marginality) provide ground-level studies of the actors who make up the inner city. Robert Sampson's Great American City: Chicago and the Enduring Neighborhood Effect offers a meso-level account of how neighborhoods work, and some of the causal relations that can be discerned at the level of the neighborhood. Thomas Hughes' Networks of Power: Electrification in Western Society, 1880-1930 demonstrates how a major technology like electric power is both structured and structuring within the urban systems in which it is introduced. And Saskia Sassen's The Global City: New York, London, Tokyo provides an account of systematic interrelations among cities in a global network. Each of these studies sheds light on how cities work; they do so at different levels of granularity; and each study brings with it an admirable degree of empirical and theoretical rigor. Each of them tells us something novel and non-trivial about how cities function. (There are prior postings on each of these authors: Young, Wacquant, Hughes, Sassen.)

This suggests something pretty moderate and pluralistic: that there is valid and important social research to be done at many levels of social organization. We won't find a unifying science of everything. But we can do social science research at many levels in ways that respect the heterogeneity of the social world while also shedding light on the workings of some important social and causal processes. There is no privileged level of research to which we should limit our social-science gaze.

What is critical about "critical realism"?

Critical realism is an approach to the philosophy of social science advocated centrally by Roy Bhaskar. Other contributors include Margaret Archer and Andrew Collier. What, precisely, does this phrase mean?

The "realism" part of the label is fairly straightforward. Bhaskar maintains that the social sciences (sometimes, often, once in a while) succeed in discovering and describing the real properties and causal powers of social structures and systems. Social entities have real causal powers, and sociology can discover the details of these powers. The approach is anti-positivist, anti-covering-law, and anti-reductionist.

So far this is the familiar position of scientific realism, applied to the social sciences. Rom Harré laid out a version of this in his causal realism theory (Causal Powers: Theory of Natural Necessity). If there is a controversial part of the theory, it is the attribution of reality to higher-level social structures like states, modes of production, and classes; but this isn't in fact very controversial.

This realist theme about knowledge of the social world is also familiar from the "causal mechanisms" approach to social explanation, where theorists argue that there are real (though often unobservable) social causal mechanisms that constitute the motive force of social change.

The more difficult problem is to say what "critical" means in this context. And surprisingly, neither Bhaskar nor his circle is very explicit about this question. The idea of "critical" realism does not appear at all in Bhaskar's first major book, A Realist Theory of Science (1975). The idea of critical philosophy is important and prominent in his second book, The Possibility of Naturalism: A philosophical critique of the contemporary human sciences (1979). But it isn't used to qualify "realism" but rather "naturalism."

Here is how Bhaskar introduces the idea of critical naturalism in the preface to the first edition of PN:

The upshot of the analysis is a new critical naturalism, entailing a transformational model of social activity and a causal theory of mind. The transformational model necessitates a relational conception of the subject-matter of sociology and a series of ontological, epistemological and relational limits on (or conditions for) a naturalistic science of society." (kl 145)

When Bhaskar comes to qualify the "realism" of RTS later in his work, he uses the phrase "transcendental realism" to describe this formulation of his theory. The idea of "transcendental realism" is derived from Kant's Critique of Pure Reason, where a transcendental argument is introduced as one that seeks the conditions of the possibility of a certain kind of knowledge. What must be true of the social world and social actors in order that they may constitute the object of empirical knowledge? Bhaskar's specific question is this:

To what extent can society be studied in the same way as nature? (kl 180)

Bhaskar and Tony Lawson explain this transcendental terminology in Critical Realism: Essential Readings:

Bhaskar sustains a metaphysical realism as a way of elaborating an account of what the world 'must' be like for those scientific practices accepted ex posteriori as successful, to have been possible. (3)

This all gives a strong clue to the reader that Bhaskar's intentions are philosophical and ontological from the start; he deliberately chooses to adopt the language of Kant's critical philosophy of knowledge for his own study of the social sciences.

So, again, what might be implied by attaching "critical" to "realism"?

Critical thinking as emancipatory. In the Marxist tradition the word "critical" has a fairly specific meaning. This meaning is reflected in Marx's eleventh thesis on Feuerbach. "The philosophers have sought to understand the world; the point, however, is to change it." Critical science is engaged science, committed science, emancipatory science. Critical science is committed to constructing bodies of knowledge that have substantial impact on the link long term best interests of humanity.

Critique as illusion-destroying. Another dimension of the idea of criticism in the Marxist tradition is the idea of "critique" -- focused intellectual effort to uncover the implicit (and misleading) assumptions of various schemes of thought and policy. Marx's Capital is subtitled "A Critique of Political Economy", and this phrase is found in many other of his titles as well. This brings in the idea of laying bare the implicit (often dominating) assumptions of various systems of thought. Laying bare the partisan assumptions underlying ideology and false consciousness is an exercise of critique.

Critique as self-creation. Finally, there is a third connotation of "critical" that pertains to its use in the social sciences: the constant reminder that the social world is not independent and separate from "us". This involves the feature of "reflexiveness" that obtains in the social world. We constitute the social world, for better or worse. And the forms of knowing that we gain through the social sciences also give rise to forms of creating of new social forms -- again, for better or worse. So it is crucial to pay attention to the plasticity of the social relations in which we live, and the innovations we create in those relations through our own processes of knowing and doing. Margaret Archer refers to this fundamental aspect of the relationship between actors and the social world as "morphogenesis" (link).

I think each of these elements is involved in Bhaskar's evolving conception of "criticial" philosophy. In the preface to the Second Edition of PN Bhaskar makes most of the points highlighted above. He refers to the importance of critique of "philosophical ideologies," including positivism, where critique is understood in roughly the sense mentioned above. (An intended second volume of PN was planned but not completed, which would have been called Philosophical Ideologies.) And in the Preface to the First Edition of PN he makes reference to ideas originally expressed in the eleventh thesis on Feuerbach mentioned above, but this time quoted in Capital (part iv, section 10, p. 505), in explaining why sociology is important to epistemology: "Sociology is necessary if we are to avoid 'that kind of criticism which knows how to judge and condemn the present, but not how to comprehend it'" (kl 136).

The emancipatory character of Bhaskar's conception of the social sciences emerges as well in his critiques of the fact-value dichotomy in science. He rejects the idea that the scientist must remain ethically neutral with respect to the social and historical processes he or she studies.

But none of this amounts to a systematic exposition of what "critical" philosophy is. At most it gives the reader some clues about the features of thinking, reasoning, and acting that Bhaskar seems to have in mind when he advocates for critical realism as an approach to the philosophy of sociology.

So it seems that Bhaskar has chosen to allow connotation to replace analysis when it comes to explaining "critical". He is a careful and explicit philosopher in much of his writing; but on the subject of "critical" method, he is surprisingly elliptical. And to me, this suggests that the import of Bhaskar's system is more on the side of "realism" than its "critical" methodology.

New thinking about social systems

There is a great deal of important international work underway today within the philosophy of social science on the general topic of social ontology. How do social structures relate to the actions of socially situated actors? How does causation work in the social realm? Can we say anything rigorous about the nature of "levels" of the social world -- micro, meso, and macro? And is there such a thing as an "emergent" social property or entity?

Sociologists and philosophers in Germany, Scandinavia, the UK, Belgium, France, Italy, and North America have undertaken serious work on these topics, and they constitute a dynamic network of thinking and debating. Some of the longstanding dualities in philosophy and sociology are questioned: individualism versus holism, micro versus macro, analytic versus continental, structure versus agent. Sociologists whose dispositions incline towards the importance of social structures are convening with rational choice theorists and game theorists; analytic sociologists are debating ontology with emergentists; and the field is displaying an energetic and productive degree of ferment.

The people whose work I am thinking of here are a motley group: Peter Hedstrom, Hans Joas, Petri Ylikoski, Bert Leuridan, Margaret Archer, Gianluca Manzo, Philippo Barbera, Pierre Demeulenaere, Julian Reiss, Rainer Greshoff, Dave Elder-Vass, Jeroen Van Bouwel, Mohamed Cherkaoui, ... And it is roughly as challenging to keep clearly in mind the manifold debates that are unfolding as it is to watch the Indianapolis 500 as the cars rocket by at 200 miles an hour. Some of these contributors are long-established scholars with huge reputations; others are young scholars with wickedly sharp minds and awesome work habits. And frankly, I'm at least as impressed with the younger generation as the elder.

One recent book that stands out as a key contribution that permits a degree of geolocation within these tangled debates is Poe Wan's Reframing the Social: Emergentist Systemism and Social Theory. Wan seems to have read every word of the debates, and he is ready to help interested parties take stock of the various theoretical perspectives.

The key axis in Wan's work -- here and elsewhere -- is that defined by Niklas Luhmann and Mario Bunge on the topic of emergent social systems. Wan is persuaded that social properties are "emergent" in some important sense, and he also seems to believe that the ideas of system and complexity are important components of our vocabulary for social ontology. But how should we understand these ideas? Luhmann's theory tends towards the position of holism, whereas Bunge's position allows that there is an intelligible connection between upper-level properties and micro-level facts and he focuses his theory of explanation on finding underlying mechanisms of various social outcomes. Wan refers to Bunge's approach as "rational emergentism" (68). Wan is respectful towards each of these theories, but he clearly favors that put forward by Bunge. Like Bunge, Wan too favors the focus on mechanisms; he admires Bunge's insistence on paying attention to the details of existing research in the natural and social sciences; and most importantly, he endorses Bunge's view that our theories of "emergent" social phenomena must be grounded in a theory of the actor.

Here is how Wan characterizes Bunge's systems theory and its relationship to a theory of the actor:

Bunge's emergentist systemism is best construed as a version of action-systems theory ..., because Bunge states explicitly that "the features of a social system depend upon the nature, strength, and variability of social relations, which in turn are reducible to social actions." (6)

And Wan believes that Bunge's approach provides a robust way of conceptualizing the nature of the social realm:

In Chapter 5 I argue for a systems approach that is ontologically sound (that is to say, transcending both holism [macro-reductionism] and individualism [micro-reductionism]), with due consideration given to the role of human factors and their actions in designing, maintaining, improving, repairing or dismantling social systems. (10)

Wan also believes that Bunge's CESM model is a helpful one for thinking about social ontology and explanation. This model incorporates composition, environment, structure, and mechanisms. For a given social entity we want to know what it is composed of; what are the features of the environment within which it functions; how is it arranged internally; and how does it work (55).

Another important part of Wan's approach is his affinity with the social theories of the critical realists -- Bhaskar, Asher, Elder-Vass. Fundamentally this comes down to the view that social structures have real causal powers, along the lines of Rom Harre's meaning of this term (110, 119, 121).

Reframing the Social is an important contribution to current debates about the nature of the social. And I agree with him that the question of social ontology is a fundamental one; perhaps more so than the issues of the epistemology of the social sciences that have generally played first violin. Further, Wan does a good job of showing how these debates are relevant to the emerging framework of analytical sociology -- sometimes in ways that cast doubt on some of the guiding presuppositions of that field. In particular, the aggregative strategy of explanation that is favored by AS is questionable once we give credence to the idea that social structures possess autonomous causal powers. Along with Dave Elder-Vass's The Causal Power of Social Structures: Emergence, Structure and Agency, this book stands as an important alternative to Hedstrom's Dissecting the Social: On the Principles of Analytical Sociology.

Here is a nice passage from Durkheim's Rules of Sociological Method that Wan quotes on the subject of emergence:

Supervenience of the social?

I have found it appealing to try to think of the macro-micro relation in terms of the idea of supervenience (link). Supervenience is a concept that was developed in the context of physicalism and psychology, as a way of specifying a non-reductionist but still constraining relationship between psychological properties and physical states of the brain. Physicalism and ontological individualism are both ontological theories about the relationship between higher and lower levels of entities in several different domains. But neither doctrine dictates how explanations in these domains need to proceed; i.e., neither forces us to be reductionist in either psychology or sociology.

The supervenience relation holds that --

X supervenes on Y =df no difference in X without some difference in the states of Y

Analogously, to say that the "social" supervenes upon "the totality of individuals making up a social arrangement" seems to have a superficial plausibility, without requiring that we attempt to reduce the social characteristics to ensembles of facts about individuals.

I'm no longer so sure that this is a helpful move, however, for the purposes of the macro-micro relationship. Suppose we are considering a statement along these lines:

The causal properties of organization X supervene on the states of the individuals who make up X and who interact with X.

There seem to be quite a few problems that arise when we try to make use of this idea.

(a) First, what are we thinking of when we specify "the states of the individuals"? Is it all characteristics, known and unknown? Or is it a specific list of characteristics? If it is all characteristics of the individual, including as-yet unknown characteristics, then the supervenience relation is impossible to apply in practice. We would never know whether two substrate populations were identical all the way down. This represents a kind of "twin-earth" thought experiment that doesn't shed light on real sociological questions.

In the psychology-neurophysiology examples out of which supervenience theory originated these problems don't seem so troubling. First, we think we know which properties of nerve cells are relevant to their functioning: electrical properties and network connections. So our supervenience claim for psychological states is more narrow:

The causal properties of a psychological process supervene on the functional properties of the states of the nerve cells of the corresponding brain.

The nerve cells may differ in other ways that are irrelevant to the psychological processes at the higher level: they may be a little larger or smaller, they may have a slightly different content of trace metals, they may be of different ages. But our physicalist claim is generally more refined than this; it ignores these "irrelevant" differences across cells and specifies identity among the key functional characteristics of the cells. Put this way, the supervenience claim is an empirical theory; it says that electrical properties and network connections are causally relevant to psychological processes, but cell mass and cell age are not (within broad parameters).

(b) Second and relatedly, there are always some differences between two groups of people, no matter how similar; and if the two groups are different in the slightest degree -- say, one member likes ice cream and the corresponding other does not -- then the supervenience relation says nothing about the causal properties of X. The organizational features may be as widely divergent as could be imagined; supervenience is silent about the delta to epsilon relations from substrate to higher level. It specifies only that identical substrates produce identical higher level properties. More useful would be something like the continuity concept in calculus to apply here: small deviations in lower-level properties result in small deviations in higher-level properties. But it is not clear that this is true in the social case.

(c) Also problematic for the properties of social structures is an issue that depends upon the idea of path dependence. Let's say that we are working with the idea that a currently existing institution depends for its workings (its properties) on the individuals who make it up at present. And suppose that the institution has emerged through a fifty-year process of incremental change, while populated at each step by approximately similar individuals. The well-established fact of path dependence in the evolution of institutions (Thelen, How Institutions Evolve: The Political Economy of Skills in Germany, Britain, the United States, and Japan) entails that the properties of the institution today are not uniquely determined by the features of the individuals currently involved in the institution in its various stages. Rather there were shaping events that pushed the evolution of the institution in this direction or that at various points in time. This means that the current properties of the institution are not best explained by the current properties of the substrate individuals at present, but rather by the history of development that led this population to this point.

It will still be true that the workings of the institution at present are dependent on the features of the individuals at present; but the path-dependency argument says that those individuals will have adjusted in small ways so as to embody the regulative system of the institution in its current form, without becoming fundamentally different kinds of individuals. Chiefly they will have internalized slightly different systems of rules that embody the current institution, and this is what gives the institution its characteristic mode of functioning in the present.

So explanation of the features of the institution in the present is not best couched in terms of the current characteristics of the individuals who make it up, but rather by an historical account of the path that led to this point (and the minute changes in individual beliefs and behaviors that went along with this).

These concerns make me less satisfied with the general idea of supervenience as a way of specifying the relation between social structures and substrate individuals. What would satisfy me more would be something like this:

Social structures supervene upon the states of individuals in the substrate described at a given level of granularity corresponding to our current theory of the actor.
Small differences in the substrate will produce only small differences in the social structure.

These add up to a strong claim; they entail that any organization with similar rules of behavior involving roughly similar actors (according to the terms of our best theory of the actor) will have roughly similar causal properties. And this in turn invites empirical investigation through comparative methods.

As for the path-dependency issue raised in comment (c), perhaps this is the best we can say: the substrate analysis of the behavior of the individuals tells us how the institution works, but the historical account of the path-dependent process through which the institution came to have the characteristics it currently has tells us why it works this way. And these are different kinds of explanations.

The heterogeneous social?

image: screenshot from video, "A Bird Ballet"

I've argued in several places that we need to think of the social world as being radically heterogeneous (link, link, link). There are multiple processes, things, structures, and temporalities at work, and what we perceive at a moment in time in the social world is a complex composite of these various things. The social world is not a unified system; it is not a manifestation of a unified underlying process; it is not a unity at all.

What does this claim about the social world mean in concrete terms? And what are the implications for the social sciences? Consider a few examples of complex social wholes:

the industrial revolution, 1700-1850
the rise of Al Qaeda, 1970-2001
urbanization in China, 1600-1700
Chicago as a functioning city, 2000
the University of Illinois, 1971
being Muslim in Toronto, 1990

These examples are themselves heterogeneous. Some are extended historical processes; others are synchronic sets of social facts; others are institutions and social environments at a time; yet others are states of social identities at a time. But the fact about heterogeneity that I want to focus on here is internal: for each social phenomenon, there are heterogeneous components and sub-processes that make it up and that generally have their own dynamics and properties.

First, where is the heterogeneity in these examples?

The industrial revolution is not one thing; it is rather a confluence of developments in technology, markets, habitation, ideology, labor practices, scientific institutions, natural resources, and numerous other social features that change over time. And the outcomes of "industrial revolution" are not uniform over regions, nations, sectors, or industries. Different parts of Britain had different experiences; and these experiences and outcomes are in turn different from those in Sweden or Italy.

Likewise, early-modern urbanization of Chinese cities is a the result of a complex ensemble of processes. We can summarize the outcome by a measure of the percentage of people living in cities greater than 100,000 at a certain moment in time. But the causes, processes, environmental factors, and institutions through which this transformation took place were highly diverse; and the cities that resulted were diverse as well. (G. William Skinner charts out much of this diversity in a number of works; The City in Late Imperial China.)

Or take Chicago in 2000. The social whole is a composite of population, institutions, political processes, demographic transitions, transportation networks, employment systems, and policing practices -- and many other factors I haven't mentioned. And if we were to ask a question along these lines -- why did Chicago come to function in 2000 in the fashion that it did? -- we would have to consider all of these processes and their composite effects, and their interactions with each other. There is no single answer to the question, "what is Chicago and how does it work?".

Being Muslim at a time and place is likewise deeply heterogeneous. Individuals, families, sub-groups, and institutions differ -- from Iowa to Ontario, and within communities and across mosques. Individuals differ in ways that are both personal and institutional. So there is no single identity that is "Muslim in Toronto"; rather, there is an ensemble of people, groups, and social organizations which in the composite represent "the many identities of Muslims in Toronto."

In fact, it seems to me that heterogeneity comes into each of these examples in a variety of ways. There are:

multiple causes at work
multiple expressions of ethnic / cultural identity
multiple purposes and understandings on the parts of participants
multiple sub-institutions with different profiles and dynamics
multiple outcomes or macro-characteristics that are denoted by the term

So the constitution and dynamics of social phenomena reflect diverse kinds of things and processes.

So where does "science" come into this picture? Is it possible to have a scientific understanding of a heterogeneous phenomenon?

Here is one possible strategy. We might hope that the sub-components of heterogeneous entities might have separable dynamics of development; so even though the city simpliciter does not have an inherent dynamic of development or functioning, its subsystems do. In this case we might say that a scientific analysis of the whole involves a separate scientific theory of the components and a synthetic effort to show how they interact.

But this approach is perhaps too generous to the power of analysis; it seems to presuppose that we can disassemble a complex and heterogeneous whole into a discrete set of reasonably homogeneous components, each of which can be treated scientifically and separately. The thesis above, though, was fairly comprehensive: "all social phenomena are heterogeneous". So that seems to imply that the results of analysis lead us to a set of components that are themselves heterogeneous -- a heterogeneity regress! And this paradoxical conclusion actually seems to be true in a very practical sense: when we disaggregate "Chicago" into "political institutions," "policing institutions," "economic institutions / market system", and the like -- we again encounter social units that have internal variation and heterogeneity.

Could we at least argue that analysis reduces complexity to a certain extent, and that the components are more amenable to scientific and causal theorizing than the whole? This more modest claim does seem to be defensible. Take the processes underlying "industrial revolution". It is possible to offer a reasonably rigorous study of the development of scientific knowledge and the institutions through which knowledge is created and disseminated, in ways that are less complex that the whole with which we began. Likewise, we can offer specialized study of the "making of the English working class" that includes some of the factors that influenced labor and politics during the period -- thereby making a contribution to a better understanding of the complex whole, industrial revolution.

In an odd way this line of thought seems to bring us back to one of the oldest debates in the history of philosophy going back to the pre-Socratic philosophers: does "nature" have a "nature"? The atomists believed that the complexity of the observed world depended ultimately on the simple properties of the components; whereas philosophers like Heraclitus maintained that nature consisted of "flux" all the way down.

(The video mentioned at the top, "A Bird Ballet," is beautiful and surprising. But I'm not certain that it fully illustrates the point I'm making about the social world. The ensemble of starlings depicted here shows a startling reality of shifting shapes and motions over time. The viewer is led to ask, how did this ensemble of thousands of organisms come to have this graceful and shifting dynamic?" So far it is a good analogy to the social. But an animal behavior specialist is likely to be able to give us a pretty simple explanation of how the individual-level flight behavior interacts across birds in flight, and results in the swarming behavior documented here. In this respect the swarm is simpler than the "heterogeneity all the way down" picture that I'm putting forward for complex social phenomena. Still, it is a powerful example of "wholes" that are less unified than they first appear.)

Social embeddedness and methodological localism

Methodological localism emphasizes two ways in which actors are socially embedded. Actors are socially situated and socially constituted.

Socially situated. In any given situation individuals are embedded within a set of social relations and institutions that create opportunities and costs for them. They have friends and enemies, they have bosses and workers, they have neighbors and co-religionists, they have families. All of these relations and institutions serve to constitute the environment within which they make plans and perform their actions. This complex setting of opportunities and regulative systems falls at the center of research for the new institutionalism (Brinton and Nee 1998, The New Institutionalism in Sociology; Ostrom 1990, Governing the Commons: The Evolution of Institutions for Collective Action ). And, as the new institutionalists rightly insist, there are very important variations across social space in the details of the workings of institutions and social networks. Two adjacent California counties may have slightly different rules of livestock liability; and these rule differences will lead to different patterns of behavior by ranchers (Ellickson 1986, "Of Coase and Cattle"; link). We might call this the "structure" factor.

Socially constituted. The second form of social embeddedness is deeper and more persistent. The individual’s values, commitments, emotions, social ideals, repertoires of action, scripts of behavior, and ways of conceiving of the world are themselves the products of a lifetime of local social experiences. Individuals are socialized throughout their childhoods and adult lives into specific ways of thinking and acting, and the mosaic of these experiences serves to constitute the moral, emotional, and practical characteristics of the individual’s social-cognitive system. The way the individual thinks about the social world is itself a feature of his/her social setting. Moreover, the mechanisms of socialization—schools, religious institutions, military experience, playgrounds, families—are themselves concrete social phenomena that are amenable to empirical sociological investigation, and they too are locally embodied. If we want to know why affluent Pakistani teenagers applauded on Facebook the murder of Punjab Governor Salmaan Taseer for his opposition to harsh blasphemy laws, then we need to look in detail at the ways in which the political and religious attitudes of this segment of Pakistani society took shape (link). We might call this the "identity" factor.

These two aspects of embeddedness provide the foundation for rather different kinds of social explanation and inquiry. The first aspect of social embeddedness is entirely compatible with a neutral and universal theory of the agent -- including rational choice theory in all its variants. The actor is assumed to be configured in the same way in all social contexts; what differs is the environment of constraint and opportunity that he or she confronts. This is in fact the approach taken by most scholars in the paradigm of the new institutionalism, it is the framework offered by James Coleman in Foundations of Social Theory, and it is also compatible with what analytical sociologists refer to as "structural individualism". It also supports the model of "aggregative" explanation -- explain an outcome as the result of the purposive actions of individuals responding to opportunities and constraints.

The second aspect, by contrast, assumes that human actors are to some important degree "plastic", and they take shape in different ways in different social settings. The developmental context -- the series of historically specific experiences the individual has as he/she develops personality and identity -- leads to important variations in personality and agency in different settings. So just knowing that the local social structure has a certain set of characteristics -- the specifics of a share-cropping regime, let us say -- doesn't allow us to infer how things will work out. We also need to know the features of identity, perception, motivation, and reasoning that characterize the local people before we can work out how they will process the features of the structure in which they find themselves. This insight suggests a research approach that drills down into the specific features of agency that are at work in a situation, and then try to determine how actors with these features will interact socially and collectively.

If both actors and structures differ substantially across social settings, then there are many possible pathways that interactions and processes can take. Suppose for the sake of example that there are three "types" of actors:

prudent, self-interested, calculating
strongly attached to religious duty
strongly attached to norms of social solidarity and perceived fairness

Now suppose we are interested in working out the likely consequences of a certain social situation -- let us say, a call for a workers' strike in resistance to the company's cutting wages. Mancur Olson's classic argument in The Logic of Collective Action considers exactly this kind of case based on assumptions reflecting the first form of agency, and he concludes that collective action will fail. So the prediction is that the call for a strike will fail. But what if a significant number of workers are type-3 actors? In this case the likelihood of successful collective action is much greater, because organizers can demonstrate that all workers are better off if the group maintains solidarity, and that it is unfair to withhold support while others are coming forward. This argument doesn't affect the type-1 actor; but it does motivate the type-3 actor.

So what are the social circumstances in which we are likely to find type-3 actors rather than type-1 actors? It has sometimes been maintained that miners have a high degree of solidarity relative to other groups of workers. They have shared circumstances of risk underground that make them viscerally aware of their dependence on the support of others; they have a culture of songs and stories that favor solidarity; and they have a history of successful collective action. So the argument is that children who grow up in this environment confront a converging set of influences that amplify the "solidarity" functions of agency and damp down the "me-first" functions. Miner communities, then, cultivate a particular kind of actors. (A book that tries to do a more careful job of evaluating miners' solidarity is Roy Church and Quentin Outram, Strikes and Solidarity: Coalfield Conflict in Britain, 1889-1966, reviewed here.)

Rationale for the philosophy of social science

The philosophy of social science is one of the smaller sub-disciplines of philosophy, and many universities have only a single course in the subject. In contrast to larger fields such as ethics or epistemology, the philosophy of social science involves a much smaller part of the intellectual spectrum and audience within the field of philosophy. So how is this discipline defined by scholars who help to constitute it today? What are some of the intellectual challenges that drive the field forward?

One way to proceed in trying to answer these questions is to take a quick look at the descriptions of the field offered by other philosophers in recent books and collections. There has been a lot of activity in the field in the past decade, so we have a lot to work with.

Here is how Daniel Steel and Francesco Guala motivate the subject in their 2011 collection, The Philosophy of Social Science Reader:

Social science studies topics -- such as economic growth, employment, crime, social inequality, cultural conflicts, and so on -- matter to almost everyone, and philosophy is important to social science. For example, there is little consensus across the social sciences as to basic methods, aims, and fundamental assumptions about human beings, and disputes on such topics are inevitably linked to long-standing discussions in philosophy. The answer to the second question [why to publish a new anthology] is that the philosophy of social science has changed quite dramatically over the last two decades. So a new anthology is required to keep track of the best research, and to map the moving boundaries of this important subfield of philosophy. (1)

They single out four major themes to characterize how the field has moved in the past two decades -- disunity, interdisciplinary, naturalism, and values. And their volume is organized around seven parts: "Values and social science," "Causal inference and explanation," "Interpretation," "Rationality and choice," "Methodological individualism," "Norms, conventions, and institutions," and "Cultural evolution."

In his more specialized collection in 2009, Philosophy of the Social Sciences: Philosophical Theory and Scientific Practice, Chris Mantzavinos offers this description of the field:

Philosophy of science examines "scientific knowledge." It tries to illuminate the specific characteristics of science, the way it is produced, the historical dimensions of science, and the normative criteria at play in appraising science…. The philosophy of the social sciences, on the other hand, traditional deals with such problems as the role of understanding (Verstehen) in apprehending social phenomena, the status of rational choice theory, the role of experiments in the social sciences, the logical status of game theory, as well as whether there are genuine laws of social phenomena or rather social mechanisms to be discovered, the historicity of the social processes, etc. (1)

Mantzavinos's organizing topics include: "Basic problems of sociality," "Laws and explanation in the social sciences," and "How philosophy and the social sciences can enrich each other." The first has to do with social ontology; the second has to do with explanation; and the third has to do with cross-over problems that affect both social science and philosophy (cooperation; virtuous behavior; and the hermeneutic circle).

Mantzavinos correctly emphasizes the importance of linking the work of philosophers on these topics to the practices of working social scientists:

The enterprise is motivated by the view that the philosophy of the social sciences cannot ignore the specific scientific practices according to which scientific work is being conducted in the social sciences and will only be valuable if it evolves in constant interaction with the theoretical developments in the social sciences. (1)

Ian Jarvie and Jesus Zamora-Bonilla's very interesting 2011 collection, The SAGE Handbook of the Philosophy of Social Sciences, is another important recent compendium in the philosophy of social science. Jarvie describes the discipline in more encompassing terms in his introduction to the volume:

As a set of problems, the philosophy of the social sciences is wide-ranging, untidy, inter-disciplinary and constantly being reconfigured in response to new problems thrown up by developments in the social sciences; in short, disorderly. As an institutionalised discipline, by contrast, philosophy of the social sciences emerged from the academic division of labour that fosters specialization and professionalization, that is, order and discipline. There is always a tension between the unruliness of intellectual inquiry and the urge towards order and discipline. (1)

Jarvie offers some interesting observations of the historical path that the discipline of the philosophy of social science has taken. He provides a perspective that is intermediate between philosophy and sociology of science; he is interested in characterizing both the intellectual influences that pushed the field and some of the institutions within which the field developed.

The field is already big enough to be fragmented. Three rough divisions would be: literatures deriving from economics and politics; from psychology and from sociology, anthropology and history. Those interested in economics lay much stress on testability, methodological individualism and rational choice. By contrast the latter group is much given to discussing causation in history and society, the nature of social wholes, problems of meaning and the social construction of reality. Psychology is an area where much of the discussion we might think of as philosophy of the social sciences is carried out in the pages of its own journals. (5)

Jarvie also provides a very interesting table of topics are they are represented in anthologies in the philosophy of social science from 1953 to 2007 (Table 1.2).

It is interesting to observe that scholars in other fields have tried to understand the history of the logic of the social sciences in quite different terms than those adopted by philosophers. George Steinmetz is a good example. He is a highly talented and innovative historical sociologist who has made very important contributions to issues about methodology and theory within sociology. (A particularly interesting journal article is "Odious Comparisons: Incommensurability, the Case Study, and 'Small N's' in Sociology"; link.) Steinmetz's 2005 collection on the influence of positivism in the social sciences, The Politics of Method in the Human Sciences: Positivism and Its Epistemological Others, is well worth considering as a sociologist's version of the philosophy of social science anthologies considered above. And the gist of the volume and the methodologies pursued are quite different. Here is Steinmetz's description of the purpose of the volume:

This collection explores the vicissitudes of positivism and its epistemological others in the contemporary human sciences. The volume's overarching goal is to provide a mapping of the contemporary human sciences from the standpoint of their explicit and especially their implicit epistemologies, asking about the differences and similarities among and within these disciplines' epistemological cultures. Taken together, the essays provide a portrait of epistemology and methodology (writ large) in the contemporary social sciences. (1)

So the volume is about the logic and epistemology of the social sciences. But it is also about the concrete sociological history of the development of the social science disciplines:

This book also offers the rudiments of a comparative historical narrative of these disciplinary developments since the beginning of the twentieth century, with an emphasis on the period beginning with World War II. Recent writing has pointed not just to the present-day conjuncture of epistemological uncertainty but also to the middle decades of the twentieth century as critical moments in the transformation of the social sciences' deep culture. (1)

There is also a substantive final goal for the volume: to lay the conceptual space necessary to conceive of alternatives to the positivist framework for social science theorizing.

The other overarching aim of this book is to survey the landscape of alternatives to positivism in the human sciences. (2)

The volume provides a number of interesting perspectives by very talented sociologists on these topics, including especially contributions by Margaret Somers, William Sewell, Andrew Collier, Michael Burawoy, Sandra Harding, and Andrew Abbott. Each of them turns our attention a quarter-turn away from highly abstract conceptual research, to a nuanced effort to understand how ideas, institutions, and frameworks have played out in the social sciences in the past fifty years.

Here is my own statement of why the philosophy of social science is important, included in my contribution "Philosophy of Sociology" in Fritz Allhoff's 2010 collection, Philosophies of the Sciences: A Guide:

The importance of the philosophy of social science derives from two things: first, the urgency and complexity of the challenges posed by the poorly understood social processes that surround us in twenty-first century society, and second, the unsettled status of our current understanding of the logic of social science knowledge and explanation. We need the best possible research and explanation to be conducted in the social sciences, and current social science inquiry falls short. We need a better-grounded understanding of the social, political, and behavioral phenomena that make up the modern social world. Moreover, the goals and primary characteristics of a successful social science are still only partially understood. What do we want from the social sciences? And how can we best achieve these cognitive and practical goals? There are large and unresolved philosophical questions about the logic of social science knowledge and theory on the basis of which to arrive at that understanding. And philosophy can help articulate better answers to these questions. So philosophy can play an important role in the development of the next generation of social science disciplines. (295)

It appears that there are important differences in the approaches to understanding the role of the philosophy of social science contained in these various passages. There is a fairly traditional "history of thought" approach, that attempts to document the way in which this sub-discipline took shape within the larger discipline of philosophy. There is a "sociology of ideas" approach that tries to link certain research traditions to a given set of research institutions and larger social priorities. And there is a philosophical approach that tries to work out what questions really are most pressing when we consider the challenges of the social sciences.

(The graphics at the top illustrate several very different ways of trying to make sense of a disparate set of related academic disciplines.)