POPREBEL Coding Thread

@alberto, any ideas? We are all having trouble.

@Wojt, if you feel comfortable, please upload a photo of yourself so your avatar isn’t blank :slight_smile:

Will do!

Hmmm…am I doing something wrong?

Dear @amelia, last week I was not able to do much coding while being exhausted from sitting at the state exams committees. (Somehow I though it would be much easier, and I will be more fit for coding.) Therefore, I would like to code this whole week, mainly on Thursday and Friday (2nd and 3rd July) when I have time and my daughter is absent (she will be at her grandma’s place) which leaves more mental and time capacities for the work. Would it be OK to postpone the deadline a bit (two days)?

Not at all. Just log out of Edgeryders, when you log in again your account syncs to the new photo you uploaded on communities. It’s a necessary evil to have single sign-on for all communities in the Edgeryders galaxy.

Thank you!

1 Like

… and it works beautifully!

1 Like

It seems everyone is running a bit behind, so let’s say that coding needs to be finished by the end of the day on the 6th. That gives us two full days to go through each other’s codebooks before we meet.

There is no purpose in us meeting if we have not reviewed each other’s codebooks, so please give an update on the 6th if this is not looking feasible for anyone and we will go from there. If this ends up being the case, let us know sooner rather than later so we can push back.

@Jirka_Kocian @Jan @Wojt

@amelia @Jirka_Kocian @matthias @SZdenek @Wojt @Richard Dear All, as I wrote to Amelia earlier, Wojtek and I had a really great “workshop” on Friday (June 19) to develop our method of work and code a bit. We have come up with three questions:

The mother-child code relationship. Say, we code “homosexuality” and it intersects in the coded fragment with the issue of “various approaches”. Wojtek codes separately assuming that if we want to get the intersection the software will catch it and display. I code: mother: “homosexuality” and child: “homosexuality: various approaches.” What is your advice?

Wojtek observed that if, say, he codes the whole entry with CODE1 and then codes a part of it with CODE2 and then wants to adde CODE3, say, to a word, he cannot do this. The system allows only two codes “on one bit of text”, it seems.

The most urgent issue: is it possible to duplicate a threat of a conversation, so we could code it on the platform separately (also inter-coder reliability) and only then to compare our codes and reconcile them? It is - we believe - a very useful exercise, at least the beginning, to compare and try to synchronize our coding “habits.”

Yours, Jan

1 Like

If this is just an exercise, you can copy/paste the content and create a new thread of your own as a “test”. I’d recommend @Jan being the test subject, since he assigns less codes, so it’ll be easier to transpose them onto the original thread if you want to merge them. Does this work for you?

Does @Wojt create a hierarchy in the backend, or just co-code? If the latter, that’s a good approach that would ideally be combined with yours as well. Yours works, but only if you assign parent-child codes that makes sense from an SSNA point of view. First, the top-level code will not auto-assign to the ones below it. @hugi and our team on a different project tried having this happen automatically, but it lead to a giant overproliferation of codes that rendered the SSNA meaningless. Second, “various approaches” is too vague on its own (to be honest, I don’t think the code “homosexuality:various approaches” is very descriptive either, so ideally the “lower level” code would be a more specific thing (e.g. “lesbian relationships”).

The plan (as articulated in this post, which you should both read carefully) with hierarchies is as follows:

So, in short – apply both top-level code and lower-level code if it applies, but be discerning. Use hierarchies in your own codebook, and we will then apply them together in the backend and use them to refine codes and apply top-level codes more widely if we need to – it’s very easy using the “copy” function in the backend to apply a top-level code to all instances of a lower-level code, but doing so automatically leads to too much vagueness/heavy co-occurrences on too high-level codes. If we do so more deliberately, in tandem with the SSNA creation, we get a more refined and accurate picture of the data. (as I describe this, @alberto, this might be a methodological point for us to note).

Just right click outside of the selection and you’ll be able to add more codes than 2 (we’ve discussed this on multiple occasions in the POPREBEL team and I go over it in training, but perhaps we need a more centralised documentation to refresh memories as it’s a lot of information to keep track of).

@matthias, feel free to check me on these answers :slight_smile:

I have a call with @Wojt on Wednesday or Thursday (time pending) that if you can also join, would be useful to have you.

@wojt, so we pick a time more amenable to @Jan, should we say 9am EST, 2pm UK, 3PM Poland? I prefer Thursday but if Wednesday works better for @jan, I can make that work.

@amelia I’m fine with either Wednesday or Thursday, time of your choosing.
As to “various approaches” and “homosexuality: various approaches” we were referring to our codebooks only. Neither I or @Jan build hierarchies in the backend.
The question as I saw it, was more about how to highlight the relationships between various themes/codes/labels so that the system (which is at least partly automated as far as I understand, I mean the generation of the network and the graph) can see not only co-occurences, but also meaningful relationships between various codes.
This particular example pertained to one respondent noting that people exhibit various attitudes towards homosexuality,and now I’m wondering if it’s worth coding at all… :slight_smile:
But let’s take a different, perhaps better example: which is better in the codebook and in the codes on the platform: the code “discrimination” (with “homosexuality” as its parent code in the codebook - a choice that, to my mind, would look better on the graph) or “homosexuality: discrimination”, provided that they pertain to the same fragments within the same single post? In our codebooks it probably doesn’t matter that much, since if we take care to somehow show the hierarchical structure it should be fine to later reflect that in the backend. Am I correct?

Great news!

As per the layers of coding, that is great news again! I probably won’t use it much myself, but may come in handy.

Thank you for all your help!

1 Like

I am no ethnographer, but would argue that neither solution makes the most of our data model.

1 . Beware false hierarchies. Discrimination is definitely not a child of homosexuality. Hierarchies are meant in a proper ontological sense: cat is a child of mammal, and France is a child of Europe.
2. Double codes are bad practice. homosexuality:discrimination destroys the information that two separate experiences are mentioned in the same post. In SSNA, you are supposed to enter two separate codes, homosexuality and discrimination. The connection between the two is meant to be emergent: if (and only if) many posts mention both experiences, the semantic graph will show a strong connection. Formally, connections between experiences or concepts are represented by edges, not collapsed into nodes.

The advantage of coding thus is large. Somewhere else in the conversation you could have discrimination associated to something else, say age. The ego network of discrimination will then show you the various aspects of discrimination in the corpus. With double coding, you have to fall back on human memory to rebuild it. “Wait, did I not see discrimination elsewhere?” With large corpora and several people coding, this problem becomes more severe. SSNA’s main advantage is scalability, so it is essential to code for that. Your codes are no longer a memory aid for you, but a memory aid for the collective effort of several researchers plus a computer system.

@amelia, what’s your take here?

1 Like

Thank you, @alberto!

My pleasure, @Wojt .

There is an additional point that I did not mention: turns out that rich posts are… rich, with 10-20 annotations. Picture them as points in the highly multidimensional space of human experience, for example a reported episode of workplace discrimination based on sexual orientation in Warsaw. You have two choices:

  • either collapse them into a n–multiple code, like homosexuality:discrimination:workplace:large_cities:poland
  • or attribute each of the n codes separately.

The latter solution will show up as an n-clique in the semantic graph, with all codes connected to each other. This is how the SSN knows that they all occurred together. You do not lose that information. At the same time you are protected from having to carry the logic of multiple codes to a rather silly conclusion. And you can reach the post from any of the codes: “let’s see what happens in the part of the convo that describes Poland”, or “hmm, that about the LGBT+ community?”

SSNs are quite elegant objects, you see. As you get the knack of them, you will find yourself picturing drawing edges as you code: “wow, I have seen discrimination a few posts back, but I don’t think it ever co-occurred with homosexuality yet!”. That sort of stuff.


Heads up, everyone: moving this out to the public workspace. There seems to be no reason for keeping it secret, in fact it is super-interesting methodological debate!


@alberto is right (you have learned well, young padawan.) @Wojt and I just had a call to clarify.

I’m making a wiki now so that we have a centralised list of coding conventions in POPREBEL (and in general, an updated centralised list of best practices for coding in SSNA). Will link when it’s finished.

1 Like

I was going to suggest that!

1 Like

Hi! @amelia
Yesterday Jan and I had a meeting and we came up with a few questions.

  1. Say, we have a code stigmatisation, an example of which may be labelling, and we encounter a text where a set of labels is listed. So a person mentions that people call her a feminazi, a dike, etc. Is it ok to code every label? I know we should rather avoid coding word-for-word, but some of them, like Polish “pisior” (a Law and Justice hardcore supportrer) wil keep popping up and we think are worth coding.

  2. Jan came up with the idea to introduce the code constructing_common_identity with children such as: nation, religious community, Europeans, etc. I have already noticed the issues of common identity being raised here and there, people speaking of the need for finding, (re) building it, complaining of it being dismantled, so I think it might be worthwhile to have it.

  3. Matters related to inclusion/exclusion dychotomy seem to play a central role in our investigations. Have you guys come across any mentions of these two? How do you code them?

  4. We have a story about living libraries, and we’d like to code them under countering_exclusion -> learning/teaching pluralism -> living libraries. Does that sound ok?

  5. Jan and I think that having two codes positive_sentiment and negative_sentiment to code for positive feelings/attitude and negative feelings/attitude towards something respectively might be of use. Do you code for emotions? Does it make sense at all to put these into backend (I have my doubts since their proliferation may distort the SSNA graphical output, yet they seem to be ethnographically valuable). Please advise.

  6. Polishness is described using various labels (provincial, religious, tolerant, etc.). How can we code a phenomenon that is being defined by means of listing its characteristics.

  7. One last thing. When I put my codes in the backend they sometimes merge automatically with already pre-existing codes by other annotators, sometimes they don’t (for example my homophobia, or country_comparison are now under Amelia’s, while my code church stays separate). It’s not a problem now, but may somehow make adding tranlsations more difficult. Will see. Just saying :slight_smile:

Ok, that’s it for now. I’m getting back to coding, I have another meeting with Jan scheduled for today. I keep working on refining my codebook and writing definitions for the codes I decided to keep.

@jan did I put it right?

Have a great day, everybody!