Some thoughts on (Collective) Ethnographic Practice

amelia · September 6, 2018, 5:17am

https://www.quantinsti.com/wp-content/uploads/2018/02/Training-Neural-Networks-For-Stock-Price-Prediction_1.jpg

I have been giving a lot of thought over the course of the last few years to the question of what it means to be an “ethnographer” on the kinds of projects that we do.

Ethnography is much more than qualitative coding. It requires, most fundamentally, participant-observation. This is why I’ve theorised before that the Edgeryders team actually does ethnography in a collective, distributed fashion: through community management + qualitative coding + qualitative data analysis. And as I think about this more, it means a kind of poetic extension of the notion of collective intelligence that is so central to Edgeryders work. From my perspective, we are not only mapping collective intelligence through ethnographic practice: we are using it to undertake that ethnographic practice as well.

In a sense this means that it may not be the worst idea to have people doing both community management and coding at the same time, as doing both could lead to a more robust total ‘ethnographer’ role. At the same time, and as I’m sure @noemi and @johncoate would attest, the role of a community manager is actually above and beyond that of an ethnographer in terms of participation — they aren’t just tasked with understanding the community, but also with creating the conditions possible for the community to function in a healthy way. This is ultimately why I think the roles should remain separate, while also closely connected (just as in Open Care).

Instead (and above and beyond) I think it will be important for our ethnographic coding team to be deeply immersed – interacting with posts and even, if they feel so inclined, contributing their own content to the platform. I got a lot out of the post I made in Open Care: it made me understand the vulnerability one has to feel to post one’s experiences, often intimate, and what it felt like to interact back and forth with other community members. This kind of insight is invaluable to an ethnographer trying to understand lived experience. I’d like to participate more this time around, through commenting and engaging in conversation with community members.

By definition, the kind of qualitative coding we do on the platform requires in-depth engagement with community members’ stories: we aren’t just coding our own field notes, we are directly coding the interactive contributions of community members, aggregating and theorising as we go to try to map the collective intelligence of the community. This, to me, is the most engaged kind of coding I’ve ever done, because a) it isn’t just coding one person’s contributions, whether in interview form or in the form of my own field notes and b) people can actually see and interact with my codes! It was very cool to be talking with @alex_levene and @johncoate (among many other generous community members who gave us their time) at Open Village not only around the ethnographic findings, but around the actual process of coding itself. Alex got to see the codes associated to his post and give feedback, see how it was visualised (with the help of @jason_vallet and @alberto) , and give coding a go himself, all of which helped me get an even more detailed and nuanced understanding.

In short – collective ethnography for collective intelligence has a methodological resonance to me, and I’m looking forward to exploring and furthering it even more.

alberto · September 6, 2018, 8:05am

This is really important, @amelia. It confirms an intuition I have had for a long time, but have never had the scientific background to validate: harnessing collective intelligence works best if you are yourself part of the collective brain. In Edgeryders, I do not behave as a referee, but as one of the players; I am not solely concerned with the conversation being factful and polite (though that’s important), I am also passionate about the values it upholds, and the hypotheses and conclusions it produces. I engage with some conversations much more deeply than with others. I cannot be “neutral” in that sense. Maybe @johncoate and @noemi can, I am not sure.

Your suggestion also speaks to the ethics of eating your own dog food, central to Edgeryders and to everything I have done in the past, from music to other online communities.

Would this be something to add to further versions of the Field Methods paper?

amelia · September 6, 2018, 9:16am

Yes, worth adding.

And you are in a way articulating one of the key methodological contributions of ethnography: there is no ‘outside’, so the more in it you are, the better equipped you are to articulate what’s going on (as long as you are being very deliberate and reflexive about your positionality, and declaring it to those you’re interacting with). You theorise with people, not about them.

alberto · September 6, 2018, 9:24am

This is so counterintutive to a scientist.

But liberating. In fact, I am skeptical that we can “do science” in the sort of issues our clients (and we) care about.

Here is a statement. What do you think about it, @amelia?

In an online community, participant observation comes relatively easy and is a relatively natural stance. With few cues to mark an outsider from insiders, the researcher is normally perceived as just another user, and blends in with the community with little effort.

amelia · September 6, 2018, 9:36am

There’s a lot to unpack in that statement, actually.

First of all, in an online community it’s relatively easy to lurk. So the participant-observer could find themselves doing a lot of observing without participating. Less easy to lurk offline, where your physical presence is harder to get around, hide, etc. That being said, this means that access comes easier in online communities, indeed making participant-observation easier, if you do it right (e.g. actually participate).

Additionally, there’s an ethical dimension that comes with being perceived as ‘just another user’. In anthropology, you have an ethical obligation as a researcher to disclose your researcher status (I clarify the discipline because some researchers in sociology ‘go undercover’ to do research. It’s a subject of ethical debate. Personally I think disclosure is the ethical practice here).

alberto · September 6, 2018, 1:49pm

What is disclosure in an online context? You can write a post about it, but you have no certainty that people have read it. You can write it in your profile, but again people might not visit your profile page. Even if you call your user “HELLOIAMANANTHROPOLOGIST” people have an amazing ability to ignore relevant info.

On the other hand, Edgeryders is clearly characterized as a space for research and collective intelligence, and people need to go through an ethical consent funnel before they post. So, in a way is like being in the virtual version of an anthropology department, but not necessarily having a clear perception of who are the informants, who are the researchers, and who is just passing buy to deliver pizzas.

johncoate · September 6, 2018, 4:45pm

But ethnography, as I understand it from conversing with Amelia as well as my observations here over the years, is a fairly subjective practice with much leeway for selection and interpretation. Disclosure won’t be seen by everyone but will be seen by some. And to those who know the ethnographer is both participating and doing the evaluating, it could raise questions and even provoke discussion about the work, since it will be the words of those people who become the basis of the coding work. That’s an interesting situation…people comment knowing they are being evaluated. Could make them say things that are thought through better. But could also cause a certain amount of intrusion into the ethnography work by engaging the ethnographer/participant into discussions about how the work gets done. (And maybe that’s a good thing too…but it is all time consuming.)

And I think it raises another ethical issue as to whether the ethnographer participates but shouldn’t try to influence the conversation, either consciously or subconsciously, based on whatever biases that person brings to the conversation.

johncoate · September 6, 2018, 4:48pm

I don’t think getting paid or not has to enter into it, if the person doing it has the right attitude. At the WELL I got paid for being present with everyone, but I was a full participant and that was seen as not just ok, but a part of the value.

amelia · September 7, 2018, 5:05am

In my view, the ethical consent funnel is like the process I go through when I’m interviewing people – I send a participant information sheet explaining the research and the protections involved, get their consent, and continue. But again this is one of those ethical questions that’s ongoing even in offline research: when you’re doing participant-observation, you’re not going to be running around the city with a sign taped to your back saying I’M AN ETHNOGRAPHIC RESEARCHER. But when you’re doing an interview with someone where you could be getting information that, should they be identifiable, could potentially harm them, your ethical commitments are different. More explicit disclosure is necessary.

Most of the time, the ethical procedures required by the university are both too stringent and not stringent enough. Better to have a more robust ethical code that you can assess case by case rather than blanket DO THIS/DON’T DO THAT rules — it’s relatively easy to know when something might be harmful, whether it’s out on an online forum or said over a drink. Use your human being common sense, and err on the side of protection. The point is that you as a researcher have ethical commitments, in my view, and those commitments shouldn’t be hidden. They should be easily discoverable (whether in post form, through an ethical consent funnel, whatever). And above all you shouldn’t be using deception unless you have an extremely compelling reason to do so (not just, ‘people are more likely to talk to me if they don’t know I’m a researcher’).

The trick here, I think, is to a) realise that there is no contribution free of ‘bias’ (e.g. there is no view from nowhere-- every intervention you make comes from a place). The goal is to be as reflexive as possible about what that place is. And part of the goal of social research is to tease out things that are of interest to the researcher (whether that researcher is doing, say, an independent project on smart cities, or is doing a H2020 project on populism). For example, it seems clearly beneficial to the research to stop the conversation from becoming about daisy-growing — but where the skill comes in is being trained enough to know when the conversation about daisy-growing actually IS about populism.

F/e, in Open Care we’d get some really generic posts about happiness. A CM like Noemi might step in and say-- hey, could you tell us specifically what you do? Who you do it with? What kinds of futures you’d like to see? In this way she’s acting like an ethnographer— teasing out the specifics and asking people to think more deeply. She’s guiding the conversation, but she’s still asking for this person’s thoughts and opinions from their own perspective. She’s not saying “We think care is X, do you agree?” for example, because that really restricts the person’s ability to respond freely. There are a lot of nuances to it.

Later on in the research, though, it might actually be useful to test the social theory you’ve come up with (this is the part of ethnography that stresses not just theorising about people up in your tower, publishing it, and moving on to the next project). So in Open Care, I might ask everyone in the community: hey, I think from what you all have been collectively articulating in various forms, care is more about bringing people together than inventing a technological fix: I think you’re saying that people are the best technology. Is this accurate or am I coming out of left field? And then people can engage in meaningful conversation around that theory before you run off and publish it.

amelia · September 7, 2018, 5:12am

Payment is tricky because either option has its ethical pluses and minuses for both researchers and participants. If you pay people you’re not extracting free labour from them for your own gains as a researcher, so that’s a good. But it also means people aren’t participating ‘freely’ and, in research into lower income populations, could actually be exerting a form of coercion since that financial incentive is so meaningful. Also your data quality usually goes down because people are just participating for the money, and are often therefore interested in getting the task done as quickly as possible.

From the researcher side, my feeling is that most of the time researchers are getting paid for their research one way or another, either through a formal job agreement or through research funding (and PhD research is often somewhere in between). It’s rare that you have a researcher self-funding for the sake of knowledge, like the wealthy astronomers back in the day.

alberto · September 7, 2018, 7:52am

I very much resonate with this, @amelia. This is a better way to express what I called “being a player, not the referee”.

melancon · September 7, 2018, 9:30am

@amelia @alberto @johncoate This is a quite interesting thread. There is one aspect that you guys do not address, that I thought would pop up as I went down the page. When I read the post title “Collective Ethno Practice”, my first thought went onto the idea that you would be discussing what it means to collectively code conversations.
Well, you may ask why I indeed had this thought to begin with. The reason is, as a computer scientist helping you guys to deal with and make sense of the data, one concern I have is to identify approaches that can help ER’s methodology to scale.
And one approach wold be to have multiple people code “together”. That would inevitably bring those coders to share their codes. Which in turn would probably bring in additional work to make sure coders have a collectively coherent approach to coding, and a convergent approach to use codes, introduce new ones, modify existing codes, etc.
I am eager to hear your thoughts on this.

alex_levene · September 7, 2018, 12:22pm

Hi @melancon

Interesting that you mention that. It’s definitely a conversation i have heard a number of time in different ER contexts, especially as we discuss scalability and the next steps of the ethnographic work. It may be that we haven’t mentioned it in the discussions on this thread because we feel we’ve had some of those conversations already somewhere else on the platform. I will look through and see if i can find the threads where that idea has been opened up and discussed already. But when i think about it more i specifically remember conversations around ‘collective coding’ happening face2face at OpenVillage.

melancon · September 7, 2018, 12:26pm

Thanks @alex_levene ! I will keep an eye on the thread to hear about other places where I can grab ideas/experiences/opinions.

Best,

alex_levene · September 7, 2018, 1:22pm

there’s @alberto’s conversation on github from 2 years ago about the OpenEthnographer project: Architectural choice: decide whether codes should have an author or not · Issue #60 · edgeryders/openethnographer-for-drupal · GitHub
In that theres a discussion about “the vision of ethnography as a collaborative research methodology”

More recently theres the discussion in this thread Open Notebook Science for Ethnographic Coding: Open Codebooks about the idea of ‘open code books’ allowing for multiple ethnographers to share the same reference hierarchies

I’m sure others in this thread may be able to point to other examples of the evolution of this discussion

melancon · September 7, 2018, 1:47pm

I see. That must be in link with discussions we had in the context of opencare (I think I remember discussions of this sort when we were at CERN in Geneva …). The open notebook post by @alberto however is already addressing technical/methodological issues while I thought colleagues here would address more “philosophical” issues. Personally, I am curious to here our experts about how they foresee the influence coders may have on each other, affecting how coders individually code and as a consequence how it may affect the overall coding. I am not sure this makes sense …

alberto · September 7, 2018, 2:12pm

Trust @melancon to zero in onto The Problem. The Problem is that ethnography is very far indeed from being a collaborative discipline.

As @alex_levene mentions, @amelia and I are now talking of something called an open codebook. This maps perfectly onto my idea of a “coding wiki”, with the great advantage that ethnographers relate to it: they know what a codebook is. The paper codebook serves as a metaphor for my wiki idea, just as the paper address book we used to have pre-computers serves as a metaphor for the piece of software called “address book” or similar.

The difficulty with that stuff is the obviously non-open culture of the discipline:

I found exactly the same thing among archaeologists. They also have a problem of “what exactly constitutes evidence”; they also have developed their own private repositories of knowledge that precede the writing of a paper (“field notes”, “gray literature”, etc.). And they also are very private about them and never share them.

johncoate · September 7, 2018, 2:48pm

Regarding getting paid:

The community managers for our 2 Horizon 2020 projects will get paid something, I’m not sure of the exact number, but it won’t be much, at least ‘per hour.’ But they will come from close association with the partners, so should be motivated to do that part of the work where you further the conversation. That part of the work takes longer than the ‘keeper of the rules’ functions because you have to really think about what the others are saying or trying to say, and the context from which they say it.

johncoate · September 7, 2018, 3:34pm

as a computer scientist helping you guys to deal with and make sense of the data, one concern I have is >to identify approaches that can help ER’s methodology to scale.

Where does machine learning come into the picture, if it does at all? @hugi and I have discussed this a few times. I used it back in the 90s to analyze multiple news sources so I could combine them onto single pages based on subject (for which we devised a categorizing system). Back then it was all very proprietary, but nominal searching today shows that there are some possibly interesting open source products such as RapidMiner with the Aylien machine learning extension that evaluate text and graph it, including clustering graphs.

Do codes ever get “settled” enough to trust augmentation of this sort, or of some sort? Is this kind of augmentation too crude by our high standards?

Because it does scale. These products were created by large multinationals demanding so much from the “document delivery” community (“super searchers”) that they needed help in their attempts to categorize and cross references thousands of pages of documents once they all got digitized. Our using this to analyze news was unique at the time, which caused one of the big companies in the field, Autonomy, to work with us gratis, or nearly gratis, so they could enlarge the scope of their product. It was pioneering stuff at the time.

So in order to deal with how labor intensive document analysis is, I wonder if we can go from one super ethnographer (Amelia) to a larger community (taught to use OE via the Academy), to a relatively stable code set, to something helped by machine augmentation. Is this realistic? Or, does it break over each data set (such as a set of conversations about populism vs. a set about the future of the internet) needing too many specific codes? Or does it break because of something else?

melancon · September 7, 2018, 6:31pm

Ah ah! I did not mention anything – on purpose – about using machine learning or similar approaches. We made some very naive attempts on the occasion of opencare Bordeaux Masters of Network hackathon – and it did not show anything valuable. That being said, I sincerely think we simply did not try it seriously enough.
In any case, if I would be using any machine learning stuff, that would only as suggestion to a human coder. Things we thought would be to maybe rank posts to indicate those that maybe should be addressed in priority. Another would be to suggest codes based on (content/codes/topology features) similarities.
I still have hopes to spend time trying these out – maybe on the occasion of a future EU project?