Featured on nav4ai

We use privacy-friendly analytics cookies and ad conversion measurement to understand what helps teachers find Grade Coach. Cookie details

All posts

ib

How to grade IB MYP Criterion B writing faster (without losing the criteria)

A teacher’s guide to marking MYP Language and Literature Criterion B (Organising) consistently across 30 papers. The strand structure, the band descriptors, and where AI helps without becoming the teacher.

EnzoEnzo · May 19, 2026 · 10 min read
An MYP rubric card with the four criteria labelled, with Criterion B highlighted in indigo, sitting beside a stack of marked student essays in an international-school classroom illustration

I taught MYP Language and Literature in two countries before I taught it well. The first year, I was the teacher who graded papers on Sunday at 11pm with the criteria document open on one screen and the essay on the other, scrolling between them every few minutes to check whether the descriptor I half-remembered was actually what the guide said. The second year was much better. I had been through one round of MYP moderation and had finally internalised the thing that makes MYP grading different from any other system I have used: it is not point-based, it is criterion-based, and once you understand the difference the marking gets faster.

This post is for the teacher in their first or second year of MYP, especially in an international school where the moderation is real and your rubric needs to hold up under outside review. Specifically about Criterion B (Organising), because that is the one I most often saw graded inconsistently across teachers in the same department.

Why MYP grading is harder than it looks

Most school grading systems count points. You get five questions, each worth four points, you score 17/20 if you got 17 points worth of stuff right. MYP does not work that way. MYP gives you four equally-weighted criteria, each scored on an 8-point scale that you arrive at by best-fit judgement, not by adding up sub-scores. IB’s own framework confirms the four criteria for Language and Literature: A. Analysing, B. Organising, C. Producing Text, and D. Using Language, each scored 0-8 with descriptor bands at 1-2, 3-4, 5-6, and 7-8.

That sounds clean until you try to grade thirty papers on a Friday night. The difficulty is that "best-fit" requires the teacher to hold every band descriptor in their head simultaneously, decide which one is the closest match for the student’s whole performance on that criterion, and stay consistent across thirty different students. The system relies entirely on the teacher’s internal calibration. Two teachers in the same department, marking the same paper, can land on different bands and both be defensible.

That is the consistency problem MYP teachers actually live with. Software cannot solve it for you, but understanding the structure of the criterion can.

What Criterion B (Organising) is actually measuring

In MYP Language and Literature, Criterion B asks one question, in three pieces. Can the student arrange ideas in a way a reader can follow, using the conventions of the genre, with the formatting and referencing the task requires? That is it. The IB guide breaks it into three strands, paraphrased here:

  1. Strand i: Organisational structures. Does the student employ structures (paragraphs, sections, headings, transitions) that serve the context and intention of the task? An essay needs essay shape. A speech needs speech shape. A letter needs letter shape.
  2. Strand ii: Coherent and logical sequencing. Are opinions and ideas ordered in a way that builds, rather than rambles? Does each paragraph develop a single idea before moving on? Does the conclusion do work, or just restate?
  3. Strand iii: Referencing and formatting. Does the student use referencing and formatting tools (citations, headings, paragraphing conventions, quotation marks) appropriate to the task? This is the strand that catches whether a student knows how a research paper looks different from a personal narrative.

The mistake I made in my first year was thinking Criterion B was "did they use paragraphs." That is one indicator of one strand. The whole criterion is about whether the writing is shaped for its purpose, all the way down to the formatting. Teachers who only look at strand ii systematically under-score strand iii, and vice versa.

The descriptor bands, in plain English

IB’s official descriptors are deliberately abstract; the language is general so it can apply across years 1 to 5 of the programme. That is helpful for the framework and unhelpful for grading thirty papers fast. Here is how I translate the bands when I am actually marking. The wording is mine, the structure is the IB’s.

  • 1-2 (limited). The reader is working hard to follow the writing. Paragraphs are absent, run together, or contain unrelated ideas. Sequencing of ideas is unclear. Referencing and formatting are minimal or inappropriate to the task.
  • 3-4 (adequate). The reader can mostly follow the writing. Paragraphs exist but are uneven; some develop one idea fully, others jump between points. Sequencing is generally logical with occasional lapses. Referencing and formatting appear but inconsistently.
  • 5-6 (substantial). The reader can follow the writing easily. Paragraphs do their job; each develops a single idea, transitions are present. Sequencing of opinions and ideas is logical and largely sustained. Referencing and formatting are appropriate to the task and used consistently.
  • 7-8 (excellent). The reader is unaware of organisation; it attracts no attention because it is done so well. The writing develops in a way that builds the argument. Referencing and formatting are not just appropriate, they are deployed with purpose.

That last band, "the reader is unaware of organisation," is the IELTS Band-9 phrase for cohesion (British Council, 2023). I borrowed it because IB’s descriptor for the 7-8 band is harder to hold in your head while marking, and the IELTS phrase captures the same observation. Substitute whichever is easier for you.

A diagram showing IB MYP Criterion B with its three strands (organisational structures, coherent sequencing, referencing and formatting) connected by arrows to a single 0-8 best-fit score
Three strands, one 0-8 score. The best-fit judgement is what makes MYP marking different from a points-out-of-twenty rubric.

The four-step marking flow that saved me about two hours per stack

Here is the workflow I converged on. It is not magic, it is just doing the same four things in the same order on every paper. That alone made my grading three times faster and noticeably more consistent across the batch.

  1. Read the whole paper once, no marking. This is the step the assessment literature is unanimous on and the step every tired teacher skips. Read it once for impression. Notice when you get lost, when you are surprised, when you stop following. That impression IS your best-fit data.
  2. Score Criterion B before the others. Organising is the criterion most affected by reading order; if you have already marked grammar errors all over the paper, you cannot un-see them when you go to assess organisation. Mark organisation first, when your eye is still on shape.
  3. Mark all three strands at once, but record one band. For each of the three strands ask: limited, adequate, substantial, or excellent? Most papers will sit in the same band on all three strands. The ones that do not are the ones worth a comment. The single band you record is the best-fit across all three.
  4. Comment on one strand, not three. If you are giving the student a 5-6 because their paragraph structure is solid but referencing is inconsistent, the comment is about referencing. Specific, targeted, fixable.

That is the entire flow. The biggest speed gain comes from step 2: scoring B before A, C, and D on each paper. Criterion B is the one most easily polluted by your view of the rest of the paper. Score it cold.

Where AI helps, where it does not

I want to be specific about this because MYP is not a context where you can take shortcuts. The IB Diploma Programme assessment is externally moderated; MYP is internally moderated, but if your school is up for evaluation the marking samples are scrutinised. AI cannot do MYP grading for you.

What AI can do is execute a rubric you have already calibrated. If you have done the four steps above on five papers yourself, you have established what a 5-6 looks like in your hand for this batch. A locked-rubric grading tool can then carry that calibration through the remaining twenty-five papers consistently. The AI is not replacing your judgement, it is extending it across the rest of the stack.

That is the only frame in which I would use AI to help with MYP. Calibrate first. Lock the rubric. Let the tool execute. Review every score before submitting; override what needs overriding. The locked-rubric question is the same one I covered in the locked-rubrics post: if the rubric is being re-derived per paper, you have built drift into your moderation sample, which is the worst possible thing for an internationally-moderated programme.

What MYP teachers should ask any AI grading tool they try

  1. Can it use my school’s actual MYP rubric? Not a generic essay rubric. The MYP descriptors I have actually trained my class to recognize.
  2. Does it score per-criterion, or per-criterion-per-strand? The per-criterion answer is fine for marking. The per-strand answer is fine for student feedback. Make sure you know which you are getting.
  3. Are scores edited by me tracked separately from AI scores? When your moderation sample shows your distribution, the moderator should be able to see which scores were AI-original and which were teacher-overridden.
  4. Can I export the rubric and the scores in a format my school’s gradebook accepts? Most schools using ManageBac, Toddle, or PowerSchool want CSV. Make sure the tool produces that.

Those four questions filter most general-purpose AI grading tools out of the running. The tools that remain, you can trust to extend your judgement instead of replacing it.

The bigger picture

MYP grading is hard because it is honest. The rubric describes what good writing looks like, and asks the teacher to make a judgement, and trusts the teacher to make it consistently. The sources I leaned on for the structural facts above are AssessPrep’s 2026 MYP assessment guide and the IB’s public Language and Literature framework page. Both confirm the structure I describe. The interpretive work, the four-step flow, the band translations, that is mine, refined over a decade of teaching it.

If you take one thing from this post, take the second step. Score Criterion B first, before anything else, when your eyes are still on shape. That single change saved me more time than any tool I ever used.

Sources cited above: International Baccalaureate Organization, MYP Language and Literature framework page; AssessPrep, "IB MYP Assessment: The Complete Teacher’s Guide" (April 2026); British Council/IDP/Cambridge, IELTS Writing Band Descriptors public version (2023). The MYP descriptor band wording above is a paraphrase; the exact text lives in the official Language and Literature subject guide on IB’s Programme Resource Centre, which you should consult directly when calibrating with your department.