AI – The Gino Case

AI Artifacts

This page collects AI artifacts that have been produced from the transcripts and other material.

For the written artifacts, we’ve corrected any errors. For the audio and video artifacts, we have not tried to edit the content. We have instead provided a transcript with any errors corrected. Both the audio and video content are compelling, though the videos can get a bit weird at times. We offer the content, weirdness and all, believing that weirdness notwithstanding, it conveys an accurate understanding of the facts of the case.

The primary engine we’ve used to create these artifacts is Google’s NotebookLM. NotebookLM permits you to upload sources and then generate video artifacts, podcast artifacts, or slide shows as artifacts. If you follow the NotebookLM link below, you can interrogate the source material, or view other artifacts as they’re created.

If you find this useful, please share the page. (If you click here, it will give you a draft tweet you can share on X.)

Written Summaries

01 / Read

This section includes AI-generated written summaries of the case, with errors corrected. Start with the 50-word bullet, then expand whichever summary fits your time. (And vote to indicate which you found most helpful!)

In 50 words

When Harvard learned of anomalies in Francesca Gino’s data, it had to investigate. The podcast argues it crossed the line: treating uncertainty as proof, limiting her defense, failing to preserve evidence, shifting theories, and revoking her tenure without clear and convincing evidence that she committed fraud.

Listen before you judge.

Short summary

The Anatomy of an Injustice

~5 min read · click to expand

Why Harvard’s Judgment Against Francesca Gino Cannot Stand

To justify this career-ending penalty, Harvard’s own rules required proof of intentional manipulation by Prof. Gino herself by “clear and convincing evidence”—a rigorous legal standard demanding a highly probable, almost unshakable certainty of guilt.

But a careful examination of the 2,500-page evidentiary record reveals a chilling reality. Harvard did not uncover a fraudster. Instead, driven by institutional panic, it orchestrated a structurally rigged prosecution. It replaced established faculty rules with new protocols, crippled Prof. Gino’s right to a defense, destroyed exculpatory evidence, ignored evidence that some of the data anomalies actually weakened or were irrelevant to the papers’ conclusions, prosecuted time-barred charges, and willfully ignored physical proof of her innocence.

Harvard’s judgment is not a triumph of academic integrity; it is a catastrophic failure of due process.

A Process Designed to Disable a Defense

When the data-anomaly blog Data Colada alerted HBS to possible irregularities in Prof. Gino’s work, the HBS Dean bypassed the faculty’s standard, two-page misconduct protocols. In secret, without faculty consultation and approval, his office drafted a draconian, 16-page procedure and weaponized it against Prof. Gino alone.

The Gag Order and Expert Ban: Under threat of termination, Prof. Gino was placed under a strict gag order. Crucially, while Harvard hired an expensive forensic firm (Maidstone) to build the prosecution, it limited Prof. Gino to two advisors. Once these were used up, she could not hire her own forensic expert to interpret the complex data. Harvard gave itself technical capacity while intentionally blindfolding the accused.

The Spoliation of Evidence: Harvard’s Research Integrity Officer promised Prof. Gino in writing that a “forensic copy” of her computer had been taken. This was a devastating falsehood. An IT technician merely copied selected files, allowing the computer’s automated operating-system logs to be permanently overwritten. Harvard destroyed the digital metadata that could have definitively proven which files Prof. Gino opened and when, and then HBS blamed her for the lack of evidence.

The Bait-and-Switch: Once Harvard formally declared her guilty, Prof. Gino was finally allowed to retain experts for her appeal. Her experts so thoroughly demolished the Maidstone report that Harvard quietly abandoned it. But instead of reconsidering her guilt, Harvard commissioned a new 230-page report from a different expert, introduced entirely new “falsification scenarios,” and gave Prof. Gino barely 30 days to respond. The appellate Hearing Committee effectively held a brand-new trial.

Prosecuting Time-Barred Claims: Due process dictates that individuals cannot be prosecuted for ancient claims because often old evidence is lost or unavailable. Harvard’s rules explicitly state that research older than six years “cannot be investigated.” Yet three of the four papers in question are now 10, 11, and 14 years old. Harvard bypassed this commonsense limit by claiming Prof. Gino’s routine string-citations of the papers “renewed” the misconduct—an absurd, bad-faith interpretation that renders the statute of limitations meaningless for any active academic.

The Collapse of the Substantive Case

Across all four allegations, Harvard’s substantive case relied on two massive, unproven assumptions.

First, the Hearing Committee claimed that all data anomalies strengthened Gino’s hypotheses, thereby proving a motive for intentional manipulation. This is demonstrably false. In every paper, numerous anomalies actively weakened the results or were entirely irrelevant. A fraudster does not manipulate data to sabotage their own research.

Second, in empirical behavioral science, Research Assistants (RAs) are the ones who gather, merge, and clean datasets. Over her career, Prof. Gino employed dozens of RAs. Yet Harvard convicted her without meaningfully interviewing the people who actually processed the spreadsheets—interviewing only two of the twelve RAs involved in the four papers at issue—preferring the improbable assumption that a star professor demoted herself to RA to manually clean and alter raw data.

When scrutinized objectively, the allegations completely disintegrate. Even in the charges that were barred by Harvard’s own statute of limitations, Prof. Gino was able to document incontestably the error of Harvard’s charges. (The allegations are presented in reverse order from Allegation No. 4 to No. 1 to chronologically reflect the age of the papers at issue.)

Allegation 4 — The Physical Receipts (the 2012 paper)

Harvard compared “File A” (an early dataset) with “File B” (the analyzed file), asserting Prof. Gino fraudulently altered File A to create File B. However, while unpacking old boxes of work documents, Prof. Gino discovered the original physical paper payment receipts for the participants. The receipts match File B. File A was missing participants, proving it was merely an incomplete working draft. Astonishingly, the Hearing Committee falsely asserted in its final report that Prof. Gino “did not provide those receipts,” literally ignoring physical evidence explicitly logged in their own record (Exhibits RX 626A and 626B).

Allegation 3 — The Mechanical Turk Cheaters (the 2014 paper)

Harvard alleged Prof. Gino fraudulently marked 12 honest participants as “cheaters” to boost her results. However, the dataset contained two columns: one for lying on a predicting a coin-flip, and one for a broader “cheated” judgment. These 12 participants were gig-workers from Amazon Mechanical Turk who achieved perfect scores on highly Googleable word puzzles, took significantly longer to finish the test, but scored averagely on un-Googleable tasks. This strongly indicates they Googled answers to protect their online ratings, which was against the rules. An RA cleaning the data would rightly mark them as rule-breakers. Harvard simply never asked the RAs if they did this. As for the other discrepancies the Harvard Committee identified, many weakened the results or were irrelevant to the hypothesis. And here again, though the data was worked on for over 500 days by multiple RAs, no RA was interviewed about their work with the data.

Allegation 2 — The Scammer and the Excel Error (the 2015 paper)

Harvard’s initial charge focused on 20 anomalous entries where students, supposedly undergrads at Harvard, were asked for their year in school and instead answered “Harvard.” Prof. Gino was able to show that those entries were not from Harvard students, but from survey scammers seeking to get paid for participating. After Harvard was forced to concede that 20 anomalous entries were actually the work of a gift-card scammer, the University was still so determined to find guilt that it changed tack and focused on 154 apparently altered cells in one spreadsheet. About half of the cited alterations were completely irrelevant to the paper’s hypothesis. Prof. Gino’s experts proved most of these alterations deterministically match a hidden Excel “Cut and Insert” function—triggered by an accidental shift-drag during routine RA data cleaning. The mathematical probability of Prof. Gino independently faking 154 cells (and making irrelevant changes in half of the cells) in a pattern that exactly mimics an automated Excel glitch is practically zero.

Allegation 1 — The SPSS Logs (the 2020 paper)

In the only timely charge out of four allegations, Harvard claimed Prof. Gino spent January 24, 2020, repeatedly tweaking data in the SPSS statistical software to “juice” her results. The actual SPSS system logs prove this never happened. Of 61 distinct commands run that day, only six were repeated, and none involved reloading the data to check adjustments. Furthermore, 39% of the anomalies had absolutely no bearing on the hypothesis. Finally, Prof. Gino’s RA explicitly testified that he had already cleaned the data prior to that exact date.

Conclusion: No Deference Without Due Process

When institutions demand judicial deference for internal tribunals, it is predicated on the assumption that they adhered to basic due process. Harvard did not.

Between error and malicious fraud, the vast majority of academic data anomalies are born of human error. Harvard converted unresolved data anomalies—stemming from routine RA error, gig-worker behavior, and spreadsheet data-processing errors—into personal culpability. It crippled its target’s defense, spoliated digital evidence, read away its own statute of limitations, neglected obvious witnesses, and blatantly misrepresented the physical record.

The University’s motto of Veritas does not grant it immunity from procedural fairness. Francesca Gino was not stripped of her tenure by clear and convincing evidence; she was stripped of it by bureaucratic predetermination, inertia once caught in its errors, and a catastrophic failure of institutional integrity. This judgment cannot be allowed to stand.

Three AI takes on the case — vote for the one you find most compelling

We asked three frontier AI systems to write a long-form summary of the same source material. Read each, then upvote your favorite. The counter at the top of each card shows how many readers picked that version.

Full summary · ~20 min

The Burden of Proof

By ChatGPT 5.5

By ChatGPT 5.5 · the case for why Harvard’s judgment cannot stand under its own evidentiary standard

The podcast’s central argument is not that Harvard was wrong to investigate Francesca Gino. It is that Harvard was wrong to convict her. Data anomalies in published research are serious. A university has an obligation to ask whether they reflect error, poor data management, misconduct by someone in the research chain, or intentional fraud by the principal investigator. But the podcast argues that Harvard moved from the existence of anomalies to the conclusion of fraud without doing the work required to justify that leap. Its thesis is narrower, and for that reason stronger: whatever one thinks about the anomalies, Harvard did not have clear and convincing evidence that Gino intentionally manipulated data, and the process by which it reached that judgment was so compromised that the judgment cannot fairly stand.

The story begins with Data Colada, a group the podcast treats with respect rather than contempt. They identified apparent irregularities in four papers connected to Gino and brought those concerns to Harvard Business School. The podcast does not deny that the anomalies were real enough to warrant inquiry. Nor does it attack the premise of post-publication review. The point is that Data Colada’s role was to raise a question, not answer it. Once Harvard received the complaint, the institution had to determine whether the anomalies could be tied, by strong evidence, to intentional action by Gino herself. That was the question. The podcast argues Harvard never answered it. It substituted suspicion for proof, institutional confidence for careful fact-finding, and an adversarial posture for a search for truth.

The importance of the burden of proof runs through the entire season. Harvard was not deciding whether Gino’s papers contained errors. It was deciding whether to impose the most severe academic punishment available: the revocation of tenure, apparently for the first time in Harvard’s history. The applicable standard, as the podcast presents it, was clear and convincing evidence of grave misconduct or neglect of duty. That standard is higher than “more likely than not.” It requires a firm conviction, not merely a plausible narrative. Lessig’s argument is that Harvard’s evidence fails even a lesser standard, but the legal and moral point is that Harvard’s own rules required more. The committee had to be able to say not simply, “these data look wrong,” but “the evidence clearly and convincingly shows that Gino intentionally made them wrong.”

That burden matters because the field at issue is not one in which a single author writes every word, collects every datapoint, and personally touches every spreadsheet. The podcast spends time explaining behavioral-science research practice precisely because the case can otherwise look simple from the outside. Gino had published roughly 140 papers involving more than 500 studies, many co-authors, and many research assistants. In this field, RAs often collect, clean, merge, and prepare data before a professor analyzes it or drafts a paper. Cleaning data can mean correcting obvious mistakes, excluding test participants, dealing with survey scammers, coding free-form responses, or reconciling files created at different stages. None of that proves innocence. But it makes attribution difficult. A data anomaly does not identify its author. In a research environment where many people handle data, the question “who did this?” cannot be answered by asking who was the most famous person on the paper.

One reason the podcast lingers over the gag orders is that public perception formed before the defense could be heard. Lessig opens by acknowledging his own position: he is Gino’s friend, later helped her pro bono, and does not pretend to be neutral. But he also explains why a one-sided public record matters. Major articles and online commentary largely encountered the prosecution’s theory without Gino’s detailed answer, because Harvard’s rules prevented her from giving that answer while the processes were pending. The podcast therefore presents itself as a corrective, not a detached documentary: an effort to put into the public record the defense that could not be offered when reputations were being fixed. That framing is important. It does not ask listeners to trust friendship. It asks them to test confidence against the actual evidence, slowly, allegation by allegation.

The podcast’s procedural critique starts there. According to the account, HBS did not simply apply its ordinary faculty-adopted misconduct rules. After Data Colada contacted the school, HBS created a new, far more elaborate procedure, without faculty approval, and applied it to Gino. That procedure imposed two constraints that shaped everything that followed. First, it gagged her. She could not discuss the charges outside the narrow channels Harvard permitted. Second, it limited her to two advisors. She chose a lawyer and a trusted HBS colleague, neither of whom was a forensic data expert. Later, when HBS hired Maidstone, a forensic firm, to analyze the data, Gino wanted her own forensic expert. According to the podcast, she was told she could not hire one because she had already used her two advisor slots. Harvard thus gave itself the technical capacity to build a case while denying the accused the technical capacity to test it.

That asymmetry is one of the most important points in the season. The charges were not simple accusations about a witnessed act. They depended on complex inferences from files, timestamps, datasets, statistical patterns, and software behavior. A forensic report could look authoritative to a faculty committee even if its assumptions were wrong. Without an expert, Gino could not meaningfully test the report before it was used to condemn her. The podcast argues this was not a harmless procedural defect. It became central. Once Gino was finally allowed to retain forensic experts during the later tenure-revocation process, they attacked the Maidstone analysis so effectively that Harvard effectively stopped relying on it. But by then, HBS had already judged her guilty, placed her on leave, notified journals, and set in motion the process to revoke her tenure. The error had already hardened into institutional commitment.

A second procedural failure concerns evidence preservation. When Gino was first notified, she brought her HBS-issued devices to campus. The podcast says Harvard’s Research Integrity Officer represented that forensic copies of her laptops’ hard drive had been made. But, as the account presents it, no true forensic image was taken. Instead, selected files were copied. That matters because the missing metadata and system logs might have shown which files were opened, from what source, and when. In later allegations, those logs could have helped decide whether Gino worked from a raw download, from an RA-prepared file, or from some other source. Because the system was not preserved, automated logs were lost. The podcast treats this as devastating: Harvard failed to preserve evidence that could have answered central questions, and then used the absence of definitive evidence against Gino.

A third procedural failure is the failure to interview the obvious witnesses. If the ordinary practice in the field was that research assistants collected and cleaned the data, then the people who touched the data were indispensable witnesses. Yet the podcast says Harvard interviewed only two of twelve relevant RAs across the four papers, and in some allegations did not ask the RAs who had actually worked with the datasets about the key issues. This omission undercuts the causal chain Harvard needed. The issue was not whether anomalies existed. It was whether the anomalies were created by Gino intentionally. If RAs could have explained the anomalies as cleaning decisions, coding choices, spreadsheet mistakes, or responses to participant misconduct, then not interviewing them left the investigation structurally incomplete. Harvard could not fairly claim to have excluded innocent explanations it never seriously investigated.

A fourth procedural theme is the statute of limitations. Harvard’s rules, as described in the podcast, imposed a six-year limit: research misconduct more than six years old could not be investigated. Yet three of the four allegations concerned papers far older than six years. The podcast argues Harvard avoided that rule by treating later citations to the papers as renewing the alleged misconduct. That move, in the podcast’s telling, makes the limitation meaningless. Academics routinely cite their prior work in later papers. If a string citation restarts the clock, then old claims can always be revived against active scholars. The rule would protect no one. The podcast argues only one allegation, the 2020 paper, was timely. The rest should never have formed the basis for a tenure-stripping judgment.

The later Harvard hearing did not cure these defects, according to the podcast. It amplified them. After HBS made its determination, the university initiated a Third Statute proceeding. A seven-member faculty committee, advised by lawyers, considered a record of more than 2,500 pages. But when Gino’s experts damaged the Maidstone report, Harvard did not step back and reconsider whether the original finding had been infected. Instead, the podcast says Harvard shifted theories, commissioned new expert work, introduced new falsification scenarios, and required Gino to respond under severe time pressure. What should have been a review became, in effect, a new prosecution. Lessig’s charge is that once Harvard had publicly and institutionally committed itself to guilt, the process became less about determining truth and more about defending a prior institutional judgment.

Against that procedural background, the podcast turns to the four allegations. Its recurring point is that Harvard’s substantive case rested on two broad assumptions. The first was motive: the anomalies supposedly all strengthened the papers’ conclusions, so they must have reflected intentional manipulation. The podcast argues that premise is false. Across the allegations, many anomalies either weakened the paper’s reported results or were irrelevant to the hypothesis. That matters because Harvard’s motive theory depends on directional benefit. A fraudster may manipulate data to help a desired conclusion, but it is much harder to infer fraud from changes that do not help or that make the result worse. The second assumption was attribution: because Gino was a lead or prominent author, she must have made the changes. The podcast argues that this ignores the actual division of labor in behavioral science.

Allegation Four concerns a 2012 paper and, in the podcast’s telling, illustrates how Harvard converted an incomplete file-history problem into a fraud finding. Harvard compared an earlier dataset, File A, with a later analyzed dataset, File B, and treated the differences as evidence that Gino had altered the data to create stronger results. The podcast argues that this theory collapsed when Gino found original physical payment receipts for study participants. Those receipts matched File B, not File A. On that account, File A was not the pristine original that had been fraudulently altered. It was an incomplete working file missing participants. File B was the file consistent with the paper trail. If that is right, then the central inference is backwards: the later file was not a fabrication from the earlier file; the earlier file was incomplete.

The treatment of the receipts becomes, for the podcast, emblematic of the whole case. Physical receipts are not speculative metadata or ambiguous statistical patterns. They are concrete evidence. Yet the committee, according to the summary, stated that Gino had not provided them even though they were in the record. The podcast treats that as a serious misrepresentation or at least a serious failure of attention. The problem was not merely that Harvard weighed the receipts differently. It was that the decision allegedly ignored or misstated the existence of evidence that directly contradicted the fraud theory. That allegation was also time-barred under Harvard’s own limitation rule, making its use doubly troubling in the podcast’s account.

Allegation Three involves the 2014 “Evil Genius” paper, which studied whether dishonesty could affect creativity. Participants played a virtual coin-flip game in which the software could determine whether they lied about guessing correctly, and then they completed creativity tasks. Harvard focused partly on twelve participants it said had been wrongly coded as cheaters, allegedly to strengthen the study’s result. The podcast’s answer is that “cheating” in the dataset had more than one meaning. One column captured lying in the coin-flip task. Another reflected a broader judgment about rule-breaking in the study. The twelve participants were Amazon Mechanical Turk workers who, according to the podcast, had perfect scores on easily Googleable word puzzles, took longer to complete them, and performed averagely on non-Googleable tasks. The inference offered is that they likely used outside help, a violation of study rules, even if they did not lie in the coin-flip task.

That does not prove exactly who coded them or why. But that is the point. The podcast says the data for this paper were worked on by RAs over a period exceeding 500 days, yet Harvard did not ask those RAs about the coding decision. If an RA, seeing suspicious MTurk behavior, classified participants as rule-breakers, that could explain the coding without fraud by Gino. And again, the podcast argues that Harvard overstated directionality: not all discrepancies helped the paper. Some weakened the findings or did not matter. Thus, Allegation Three repeats the pattern: a real anomaly becomes suspicious; suspicion becomes motive; motive becomes attribution to Gino; but the intermediate evidentiary steps are missing.

Allegation Two concerns a 2015 paper and is among the most intricate parts of the podcast. Harvard first focused on entries in which participants, supposedly Harvard undergraduates, answered “Harvard” when asked for their year in school. That looked strange. But the podcast says Gino’s defense showed that those rows were not evidence that she fabricated responses. They were responses from survey scammers trying to get paid, not legitimate Harvard students. Once that explanation emerged, the podcast says Harvard shifted attention to 154 apparently altered cells in one spreadsheet. The new theory was that the alterations were intentional changes designed to improve the result.

The defense account, as summarized in the podcast, is that many of those cells were irrelevant to the hypothesis and that the pattern of changes matched a hidden Excel “Cut and Insert” behavior triggered by an accidental shift-drag during routine data cleaning, and a second Excel error. This is a technical point, but the implication is simple. If a spreadsheet operation can deterministically produce the pattern of cell changes, then the existence of that pattern is not evidence of a human deciding, cell by cell, to falsify results. It is evidence of a spreadsheet event. The podcast emphasizes that it would be extraordinarily unlikely for a person independently fabricating data to reproduce the exact pattern of a mechanical Excel glitch, especially while making many changes irrelevant to the research hypothesis. Here too, the theory of intentional fraud weakens once the mechanics of data handling are examined.

The short supplemental episode on Allegation Two responds to a timeline objection involving Thanksgiving 2014. The reason the supplement matters is not merely the date. It shows the podcast’s method: slow the claim down, test the inference, and ask whether the record actually supports the conclusion Harvard drew. The podcast argues that the timing point does not rescue Harvard’s theory. Rather than proving intentional alteration, the disputed sequence remains consistent with the broader defense account of ordinary file handling, spreadsheet error, and overconfident inference. The point is not that every fact is tidy. The point is that messy facts cannot be transformed into clear and convincing proof by ignoring the mess.

Allegation One is the only timely allegation and therefore carries special weight. It concerns a 2020 paper about networking and moral psychology. Harvard’s theory, as the podcast presents it, was that on January 24, 2020, Gino repeatedly altered data in SPSS, checked whether the changes improved results, and continued until the outcome favored the hypothesis. This is the closest Harvard came to a direct narrative of manipulation by Gino herself. But the podcast argues the system logs contradict that narrative. Of 61 distinct SPSS commands run that day, only six were repeated, and the pattern did not show repeated reloading of data after changes. One repeated command was apparently run back-to-back without an intervening reload, which is inconsistent with the story of iterative tampering followed by checking.

The source-file issue is equally important. The podcast says both forensic experts agreed that the file Gino analyzed on January 24 had been copied from another file. Her RA, Alex Rohe, testified that he had already prepared or at least partially cleaned data before that date and provided it to her in the ordinary way. If Gino was working from an RA-prepared file, then comparing her analyzed file to a raw Qualtrics download does not establish that she made the differences. It only establishes that the analyzed file differed from a raw file. The missing forensic image could have helped identify the source of the working file, perhaps resolving whether she used a thumb drive or downloaded raw data herself. Harvard’s failure to preserve that evidence matters most here, because this was the allegation that should have been tested most carefully.

Allegation One also repeats the motive problem. The podcast says 39 percent of the anomalies had no bearing on the hypothesis. If many changes did not affect the claimed result, then the inference that every change was part of a scheme to “juice” the outcome becomes less persuasive. The podcast’s conclusion on this allegation is forceful: once ordinary practice, expert agreement about the copied source file, RA testimony, and the SPSS logs are considered together, Harvard lacked a foundation for finding that Gino modified the data at all. At minimum, the evidence was nowhere near clear and convincing.

The cumulative argument is more powerful than any single episode. Harvard’s case, as reconstructed by the podcast, required the same fragile bridge in each allegation. First, identify a data anomaly. Second, assume the anomaly helped the paper. Third, assume that if it helped the paper, it was intentional. Fourth, assume that if it was intentional, Gino did it. The podcast attacks every step. Some anomalies were real, but real anomalies do not assign responsibility. Some changes may have helped, but many did not. Intentionality cannot be inferred from patterns equally or better explained by RA coding, participant misconduct, file incompleteness, or software behavior. And attribution cannot be inferred while bypassing the people who handled the data and destroying or failing to preserve the evidence that might have shown the path of the files.

The podcast is also careful to distinguish academic integrity from institutional self-protection. It does not argue that universities should ignore suspected fraud. It argues the opposite: because academic integrity matters, investigations must be fair, technically competent, and procedurally honest. A sloppy acquittal would be bad for science. But a sloppy conviction is worse, because it creates the appearance of accountability while evading the discipline of proof. In this account, Harvard’s failure was not merely legalistic. It was epistemic. The institution that claims Veritas as its motto treated uncertainty as something to be managed, not resolved.

The human dimension is present but not used as a substitute for evidence. Gino lost her position, income, reputation, consulting work, and the security of an academic career built over decades. The podcast includes her description of telling her children, watching clients disappear, and trying to keep working while under gag orders. But the argument does not finally rest on sympathy. It rests on the record. The harm matters because the sanction was severe; the sanction matters because the proof needed to be correspondingly strong. Harvard’s decision ended a career. The podcast asks whether the evidence was strong enough to justify that end. Its answer is no.

The closing argument is therefore not simply “Gino is innocent,” though Lessig plainly says he believes she is. It is that Harvard’s judgment cannot stand under the standard Harvard was bound to apply. The process disabled the defense at the moment the defense mattered most. The investigation failed to preserve key evidence, failed to interview key witnesses, relied on a forensic report later undermined, evaded a limitations rule, shifted theories, and treated ambiguous anomalies as proof of intentional fraud. On the substance, each allegation had alternative explanations that Harvard did not eliminate, and in several instances the record allegedly contradicted Harvard’s central claims.

The podcast ultimately asks the listener to resist the seductive simplicity of the public story. “Professor of dishonesty caught being dishonest” is an irresistible headline. It is also, the podcast argues, a dangerous substitute for adjudication. The case is not simple. It involves old files, weak data practices, RAs, survey scammers, MTurk workers, physical receipts, Excel glitches, SPSS logs, missing forensic images, and a university under reputational pressure. In that complexity, Harvard had to be more careful, not less. The argument of the podcast is that Harvard had reason to investigate Francesca Gino, but not enough reason to destroy her career. It had anomalies. It did not have clear and convincing evidence of fraud by her. And when the institution could no longer prove the difference between error and guilt, it chose guilt.

The resulting question is not whether Harvard faced an easy case. It is whether, facing a hard one, Harvard earned the confidence its judgment demands. On the podcast’s account, it did not, and the consequence was not an abstract procedural defect but a life-altering institutional error.

Full summary · ~20 min

The Anatomy of an Injustice

By Claude

By Claude · Why Harvard’s judgment against Francesca Gino cannot stand

In May 2025, Harvard University did something it had never done in its nearly four-hundred-year history. It revoked the tenure of a faculty member. Francesca Gino, a behavioral scientist at the Harvard Business School, was stripped of her career on charges of academic data fraud across four published papers. Many people read the headlines and moved on. The story seemed settled. A star professor caught cooking her numbers, an institution acting with rare decisiveness, science self-correcting at last.

The story is not settled. A careful examination of the 2,500-page evidentiary record shows something different. It shows an institution that panicked, drafted a new procedure to convict rather than to investigate, denied the accused the help she needed to defend herself, lost the digital evidence that would have exonerated her, prosecuted charges its own rules expressly forbade it from prosecuting, ignored physical proof in its own files, and then misstated the facts in its final report. The judgment cannot be allowed to stand on the merits, and it cannot be allowed to stand on the process. This is the case for why.

The Standard Harvard Was Required to Meet

Harvard’s own rules required the Hearing Committee to find guilt by clear and convincing evidence. That is a demanding standard. It is not the preponderance standard of an ordinary civil suit, where a fact finder must be 51 percent confident. It is not quite the beyond-a-reasonable-doubt standard of a criminal trial. Courts and scholars place it at roughly 70 to 85 percent confidence: a firm belief or conviction that the allegation is highly probable, absent serious or substantial doubt.

Everyone agrees that the four papers contained data anomalies. The question was never whether anomalies existed. It was whether those anomalies were the product of intentional manipulation by Professor Gino herself, proved to the firm conviction the standard demands. The standard mattered for a simple reason. Behavioral science research is produced by teams. Research assistants gather the data, transcribe it from paper or download it from Qualtrics, clean it, merge files, recode variables, prepare it for analysis. Over her career Professor Gino employed more than sixty research assistants. She did not clean her own data. No one disputes this. So when an anomaly appears in a dataset, the threshold question is who introduced it, and the only way to answer that question fairly is to investigate both possibilities. Harvard investigated one.

A Process Built to Convict

When Data Colada, a group of data scientists who police academic research, contacted Harvard Business School in July 2021 with concerns about four of Professor Gino’s papers, the Dean did something extraordinary. He bypassed the faculty’s existing two-page misconduct policy. Behind closed doors, without faculty consultation, his office drafted a new sixteen-page procedure. He never brought it to the faculty for debate. He never asked them to approve it. He simply applied it to Professor Gino.

That new procedure imposed two crippling restrictions. First, it placed her under a strict gag order, threatening termination if she discussed the charges with anyone other than two designated advisors. She was forbidden from speaking to her research assistants, her coauthors, or anyone else who might have helped her reconstruct what happened with the data. Second, it limited her support to those two advisors only. On the advice of the Research Integrity Officer, she chose a lawyer and a faculty colleague. Neither was a forensic data analyst. Meanwhile, Harvard hired Maidstone, an expensive forensic firm, to build the prosecution. The Business School thus had expert capacity to interpret complex datasets. Professor Gino did not. She was permitted no expert to evaluate the forensic evidence that would determine her fate.

The consequences of this asymmetry were not theoretical. Once Professor Gino was finally allowed to retain her own experts, after Harvard had already declared her guilty, those experts demolished the central claims of the Maidstone report. They demolished them so completely that Harvard quietly abandoned the report. The committee that had convicted her had relied on a document Harvard itself no longer stood behind. A fair process would have reconsidered guilt. Harvard did not reconsider. It commissioned a new 230-page report from a different expert, Professor Jeremy Freese, introduced entirely new theories it called “falsification scenarios,” and gave Professor Gino thirty days to respond to material she had not seen. Her legal team had spent over two million dollars and a year preparing a response to the original report. They were told to start over in a month. Discovery had closed. They could not even seek the evidence they needed to test the new charges.

There was a third procedural failing that compounded the first two. Harvard’s rules, tracking the federal Office of Research Integrity, expressly state that allegations of misconduct in research more than six years old “cannot be investigated.” Three of the four papers in this case were six, seven, and nine years old at the time HBS received the charges. The rule allowed an exception only when a scholar continued to rely on the allegedly fabricated data. In 2024 the federal rule was clarified to make this explicit: the exception applies only to citation of the specific portion of the research record alleged to have been fabricated. Professor Gino had not done that. She had simply listed and cited the older papers as any active scholar does. Yet Harvard read the exception so broadly that any citation, however general, was treated as a renewal of misconduct. The interpretation makes the limitations rule meaningless for any working academic. As Justice Jackson once put it, statutes of limitations exist to protect parties “from the burden of defending claims after the evidence has been lost, memories have faded and witnesses have disappeared.” Harvard had bound itself by such a rule. It read the rule away.

Two Failures That Run Through All Four Allegations

Before turning to the four allegations, two facts must be named. They appear in each one, and together they negate the inference of guilt across the entire case.

The first is motive. In every allegation, the Hearing Committee asserted that all of the data anomalies tended to strengthen the conclusions of the paper. That assertion was meant to establish motive: only Professor Gino, the author, would have an interest in strengthening her conclusions, so she must be the one who introduced the anomalies. The assertion is false in all four cases. In every paper, a substantial portion of the anomalies either weakened the results or had no bearing on the hypothesis at all. A fraudster does not waste her time manipulating data to sabotage her own conclusions. Mixed anomalies are the signature of error, not fraud.

The second is the failure to investigate. In each of the four allegations, the data preparation work was done by research assistants. Across the four papers, at least twelve research assistants worked on the relevant datasets, in one case for over five hundred days. Harvard interviewed two. It simply presumed the research assistants were not responsible and never asked. That is not investigation. It is assumption. When the question is whether an anomaly came from the professor or from the people who actually handled the data, you cannot answer the question by speaking to only one side.

Allegation 4: The Physical Receipts

The oldest of the charges concerned a 2010 study Professor Gino conducted at the University of North Carolina, just before she moved to Harvard. The study tested whether signing a pledge of honesty before a task would make participants more truthful when reporting their performance. Because the study ran on paper, the raw data exists only as transcribed survey forms. Harvard compared two files: an early dataset prepared by Professor Gino’s lab manager Jennifer Fink, which Lessig called File A, and the dataset that produced the published analysis, File B. The Committee concluded that the differences between them, all but one of which strengthened the paper’s results, proved Professor Gino had fraudulently altered the data.

The entire charge rests on the assumption that File A was the final, complete dataset given to Professor Gino. Nothing in the record establishes this. Jennifer Fink never testified that File A was final. The HBS data consultant could not confirm it. The file was called “Taxstudy,” not “Final.” And in an email predating File A, Fink had explicitly flagged that participants had filled out their forms in very strange ways and that work remained to be done. File A still contains that uncorrected data. It is plainly an interim file.

What clinches the point is a discovery so improbable it sounds invented. While preparing her defense, Professor Gino unpacked old boxes that had moved with her from North Carolina to Harvard and finally to her home garage. Inside one box she found the original paper payment receipts for the study’s participants. Every participant who had completed the study had been paid, and the payment had been recorded on a receipt. The receipts establish, conclusively, that File A is missing participants who actually took the survey. File B matches the receipts. File A does not. There is no fraud to explain because there is no baseline from which the data was changed. File A is simply incomplete.

Her lawyers presented this evidence to the Hearing Committee. The Committee’s final report stated that Professor Gino “did not, however, provide those receipts.” This was false. The receipts were in the record at exhibits RX 626A and RX 626B. The Committee either did not know its own record or chose to ignore it. Either way, the charge cannot survive contact with the document the Committee said did not exist.

There was a second part to Allegation 4. Harvard pointed to a small change in an early draft of the paper’s description of the experiment and alleged that Professor Gino had revised the description to cover up a flawed study design. The original description suggested that participants had been paid twice, in two different rooms. But that design would be logically impossible. It would put the outcome variable, lying about performance, before the treatment variable, the honesty pledge. The lab manager, who actually ran the study, testified that no one had been paid twice and that no money had ever been returned. The receipts confirm her account. The data structure, with a single payment column, confirms it. The only evidence pointing the other way is a draft document with an identical typo to another draft, which proves the second was copied from the first. One mistaken description, corrected. That is the entire foundation of the cover-up charge.

Allegation 3: The Mechanical Turk Cheaters

This charge concerned a 2014 paper, “Evil Genius: How Dishonesty Can Lead to Greater Creativity,” based on a study conducted in 2012. Participants were recruited through Amazon’s Mechanical Turk platform. They were asked to predict the outcome of a virtual coin flip, then report whether they had guessed correctly. The study measured who lied about having guessed right. Liars were classified as cheaters, and the question was whether cheaters scored higher on tests of creativity.

Harvard alleged that twelve participants who had honestly reported guessing the coin flip wrong were nonetheless marked as cheaters in the final dataset, and that this reclassification fraudulently strengthened the results. The argument assumes one thing about the data file: that the “reported guessed correctly” column and the “cheated” column were meant to record the same information. If they were redundant, then reclassifying someone as a cheater without changing the lie-about-coin-flip column would be tampering.

But the two columns were not redundant. If they had been, there would be no reason to have both. The presence of a second column suggests cheating was being assessed on more than the coin flip alone. Look at the twelve participants. They were Mechanical Turk workers, whose livelihoods depend on reputation scores. The RAT task they were given asks for a word linking three other words, a common test whose answers are easily Googled. The instructions told participants not to consult external help. These twelve achieved a mean RAT score of 11.2, compared to 7.7 for everyone else, with one hundred percent of them solving two of the hardest puzzles, ones that stumped over thirty percent of the broader group. On the Usage Task, which cannot be Googled, they were merely average. They took longer to complete the survey than other participants, consistent with someone alt-tabbing to a search engine. They scored low on a measure of rule-following. Every signal points the same way. A research assistant cleaning this data could reasonably conclude that these twelve had cheated by looking up answers, even though they had told the truth about the coin flip.

Whether that is in fact what the research assistant did, we do not know. Harvard never asked. The data for this paper was worked on by at least five research assistants over more than five hundred days. None was interviewed. The single question that could have resolved the charge, whether the cheating column was meant to capture a broader judgment, was never put to anyone who would have known.

The same pattern repeats in the paper’s two creativity coding tasks. With the Usage Task, the source coding data exists, and it shows where the research assistant copied data from the wrong rows or the wrong tab. The largest such error, affecting eighteen participants, weakens the paper’s conclusions. No fraudster manipulates data to make her results weaker. The error is plainly a research assistant’s mistake. The presence of weakening errors of this exact type undermines the inference that any other anomaly was fraud.

Allegation 2: The Scammer and the Excel Error

The 2015 paper at issue here studied feelings of inauthenticity. Data Colada flagged twenty entries in which participants, supposedly Harvard undergraduates, had answered the question “what year are you in school?” with the word “Harvard.” That answer makes no sense. The Business School Investigative Committee concluded Professor Gino must have fabricated the entries to bolster her results, perhaps even pocketing the Amazon gift cards along the way.

Once Professor Gino was allowed to hire forensic experts, the charge collapsed within weeks. The experts pulled the Qualtrics metadata, which Harvard’s investigators had never bothered to retrieve, and showed the twenty entries came from twenty different IP addresses, mostly in data centers in foreign countries, all submitted through an absurdly outdated browser, Firefox version 5 running on Windows XP in 2014. Professor Gino was a Mac user. The pattern was unmistakable: a single scammer using proxy servers and a form-filler to harvest gift cards. Even the Hearing Committee eventually conceded the twenty entries had nothing to do with her.

Harvard then made a second claim, that some participant data appearing in the publicly posted file was missing from the raw Qualtrics export. Nine months after the Business School had declared Professor Gino guilty on the strength of this claim, the same expert who made it issued a correction. There were no missing entries. Every record matched. The foundation of the original conviction had crumbled twice. Yet the prosecution continued.

Instead of dropping the charge, Harvard sent the paper to a new expert who focused on 154 cells that had changed between an early file and the analyzed file. The Committee declared that all 154 changes strengthened the hypothesis. That was false. At most eighty did. The other seventy-four were in columns the analysis never used. There was no motive to alter values that did nothing.

The pattern of the 154 changes was not random. The values appeared to have been shuffled between blocks of cells. Professor Gino’s experts discovered that an obscure Excel feature called Cut and Insert, triggered by a shift-drag of a selected range, swaps the source and destination blocks without any warning. A single accidental shift-drag of a 9-by-8 block of cells would account for 94 percent of the observed changes. Two additional inadvertent shortcuts account for the rest. The mechanism is deterministic and replicable. The probability that an evil professor randomly tinkering with cells in a 10,311-cell spreadsheet would produce the exact same 154 changes in the exact same pattern that one accidental keystroke produces is, for all practical purposes, zero.

Allegation 1: The SPSS Logs and the Lost Forensic Image

This was the only allegation within the six-year limitations period, and on the merits it is the weakest of the four. The paper, published in 2020, involved a study in which Professor Gino had analyzed data on January 24 of that year. Harvard claimed that on that single afternoon she had downloaded the raw Qualtrics data, cleaned it herself, and made 1,066 changes to strengthen her results. The Committee found that all 1,066 strengthened the hypothesis. In fact, 39 percent did not.

The Committee asserted that Professor Gino had spent the afternoon “running commands in a manner consistent with repeatedly altering the data and then checking whether it improved the results.” This is the language of someone who has read the SPSS log file and found evidence of tampering. The log file shows nothing of the sort. Of 61 distinct analysis commands Professor Gino ran across three SPSS sessions that day, only six occurred more than once for the relevant dataset. Four ran twice, two ran three times. In one of those repetitions, she ran the identical command twice in a row without reloading the data file. Reloading is what you would have to do if you were modifying the data in Excel and testing whether the change had the desired effect. She did not reload. The behavior is consistent with a busy researcher exploring her data, not with a fraudster iterating tweaks. Her calendar that day shows a meeting and a school pickup. Her youngest child was two and a half months old.

Beyond the SPSS log, the central factual claim of the allegation, that she downloaded and cleaned the data herself on January 24, was contradicted by her own research assistant, Alex Rohe, who testified he had cleaned the data and provided her a file before that date. A reference to a file with the letter R appended to its name appears in the directory of her computer, consistent with her practice of naming data files by the initial of the research assistant who prepared them. Even Harvard’s own forensic expert at the tenure hearing agreed that the file she analyzed on January 24 was not freshly downloaded from Qualtrics. Two forensic experts, on opposite sides, agreed. The Hearing Committee ignored both.

And then there is the punchline. When the investigation began in October 2021, the Research Integrity Officer told Professor Gino in writing that Harvard had taken a “forensic copy” of her computer. A forensic copy is a complete bit-by-bit image of the hard drive, including the operating system logs that record which files were opened from which sources and at what times. Twenty-six months later, when Professor Gino’s forensic expert finally got to examine what Harvard had preserved, he discovered that no forensic image had been made. A technician had copied a selection of files. The operating system logs, which could have shown definitively whether the data Professor Gino worked with on January 24 came from a thumb drive given to her by Alex Rohe or from a fresh Qualtrics download, had since been overwritten in the normal course of operation. The single piece of evidence that could have ended the dispute had been destroyed by the very investigation that depended on it. At the hearing the Research Integrity Officer was unable to remember conversations adverse to Harvard and blamed Professor Gino for the limited scope of the file copy, claiming he had asked her to designate files in order to protect her privacy. He had copied a folder containing her tax returns.

What Should Happen

The story of how Harvard reached the wrong result is not a story of malice. It is a story of an institution that panicked at a complaint from a respected outside group, drafted a procedure designed to dispatch the problem quickly, and then could not stop the bureaucratic momentum it set in motion. Each new layer of the process had every reason to defer to the layer beneath it. The Hearing Committee deferred to the Investigative Committee. The President deferred to the Hearing Committee. Each was looking at conclusions the layer below had already reached, and reversing those conclusions would have required confronting the embarrassment of having reached them.

At a minimum, Harvard should acknowledge that three of the four allegations should never have been investigated under its own rules. As for the fourth, the one allegation timely within the limitations period, the university should submit the record to an independent fact finder, perhaps a former judge, capable of evaluating whether the evidence meets the clear and convincing standard. It cannot. And when that conclusion is reached by someone outside the chain of deference, Harvard will have to find the courage to admit what its own record makes plain. Between error and fraud, the vast majority of cases will be error. The protection against confusing the two is process. Harvard failed at process. It owes Professor Gino, and the standard it set for itself, the integrity to say so.

Full summary · ~25 min

The Anatomy of an Injustice

By Gemini

By Gemini · Why Harvard’s judgment against Francesca Gino cannot stand

In May 2025, Harvard University did something unprecedented in its nearly 400-year history: it revoked the tenure of a faculty member. Francesca Gino, a star behavioral scientist at the Harvard Business School (HBS), was stripped of her career following accusations of academic data fraud across four published papers. To justify this career-ending penalty, Harvard’s own rules required proof of intentional manipulation by Professor Gino herself by “clear and convincing evidence”—a rigorous legal standard demanding a highly probable, almost unshakable certainty of guilt.

Any lawyer or scholar will recognize that “clear and convincing evidence” is an extraordinarily high bar. It plainly requires clear proof of a wrong, leaving the fact-finder with a firm belief or conviction of guilt, absent serious or substantial doubt. Yet, a careful examination of the 2,500-page evidentiary record reveals a chilling reality: Harvard did not uncover a fraudster. Instead, driven by institutional panic, it orchestrated a structurally rigged prosecution. It replaced established faculty rules with unvoted protocols, crippled Professor Gino’s right to a defense, destroyed exculpatory evidence, ignored data anomalies that actively weakened its own case, prosecuted time-barred charges, and willfully ignored physical proof of innocence.

Harvard’s judgment is not a triumph of academic integrity; it is a catastrophic failure of due process.

Part I: A Process Designed to Disable a Defense

The institutional response by the Harvard Business School administration was born out of panic. In July 2021, Data Colada—a blog run by three behavioral scientists investigating empirical academic work—contacted HBS. They informed the school that they had identified data anomalies in four papers co-authored by Professor Gino.

Crucially, Data Colada proceeded contrary to its own stated principles. Their public policy is to share drafts of investigative findings with the authors themselves to solicit feedback seven days before going live. They did not do that here. Instead, they went directly to her employer, issuing a clear threat to publish their posts unless HBS intervened immediately, warning that their sources would “take matters into their own hands.”

Terrified by the prospect of public embarrassment on social media, the HBS Dean bypassed the faculty’s standard misconduct protocols. HBS already possessed a research misconduct policy—a short, two-page document debated and adopted by the faculty. Over the summer of 2021, entirely behind closed doors and without faculty consultation or vote, the administration drafted a brand-new, 16-page policy explicitly designed to govern Professor Gino’s case.

This newly manufactured procedure introduced two draconian, highly consequential conditions that systematically disassembled Professor Gino’s capacity to defend herself.

1. The Weaponization of Confidentiality (the Gag Rule)

Under threat of immediate termination, the administration placed Professor Gino under an absolute gag order. For nearly two years during the HBS investigation, and later during the university tenure proceedings, she was strictly forbidden from discussing the charges with anyone outside of two designated advisors.

This gag rule was a structural disaster for truth-seeking. While popular media outlets published detailed accounts of the accusations, Professor Gino was barred from defending herself in public, offering her side of the story, or providing a narrative defense.

More devastatingly, the gag rule severed her from her own research ecosystem. In the field of empirical behavioral science, senior professors run joint research operations utilizing a massive team of collaborators. Professor Gino had published roughly 140 papers involving over 500 individual studies and 120 co-authors. Crucially, the day-to-day handling, merging, and “cleaning” of spreadsheets was executed by a rotating roster of more than 60 paid research assistants (RAs) and gap-year students.

Because of the gag order, Professor Gino was prohibited from speaking to the very research assistants who had manually compiled, formatted, and processed the spreadsheets years prior. She could not ask them how data files were merged, what exclusions were made, or how formatting errors might have occurred. She was locked in an administrative cage, forced to answer hyper-technical data queries entirely from memory, while the ground-level operators of her data spreadsheet factory were placed entirely out of reach.

2. The Expert Ban and Inequality of Arms

The complexity of evaluating modern statistical datasets requires highly technical expertise. Recognizing this, the HBS Investigative Committee (IC) hired an expensive digital forensics firm, Maidstone, to analyze Professor Gino’s hard drives and compile a prosecution report.

Yet, when Professor Gino requested the ability to hire her own technical data expert to interpret and rebut Maidstone’s findings, the Research Integrity Officer (RIO) flatly denied her request. She was told she had “used up her two slots” on her initial advisors.

This is a fundamental violation of due process. In any legitimate judicial context, one party cannot be granted exclusive access to high-powered technical evaluations while the accused is intentionally blindfolded. Professor Gino was forced to attend an intensive Zoom interrogation with the IC where they screen-shared highly complex, hundreds-of-pages-long Maidstone reports that she had been given only two weeks to review without expert guidance. On the spot, she was expected to provide immediate explanations for microscopic discrepancies within data tables she had not looked at in years.

When she logged off the call and wept from pure exhaustion and sadness, the administration interpreted her inability to provide instant technical explanations as an implicit acknowledgment of guilt.

3. The Spoliation of Digital Evidence

The procedural behavior of the HBS administration moved from incompetent to actively deceptive during the seizure of Professor Gino’s electronic devices. On October 27, 2021, the RIO ordered Professor Gino to immediately turn over all university-issued laptops and computers. On November 4, 2021, the RIO assured her in writing:

“All the electronic files included in the inventory are forensic copies, and the original sources remain available to you.”

This statement was a devastating falsehood. Taking a “forensic copy” or an “image” of a hard drive is a baseline requirement of technical investigations (Forensics 101). A true forensic image captures the entire disk state, preserving system logs, metadata, USB connection histories, browser cache logs, and remnants of deleted files. These operating system logs act as an un-fakeable, objective spy in the room, proving exactly which files were modified, by whom, from what source, and at what precise millisecond.

Harvard’s investigators never took a forensic image. Instead, they had a general IT technician turn on the machine and manually copy a narrow selection of 343 files from the hard drive onto an external drive, explicitly ignoring the rest of the machine’s architecture.

Because computers continuously write over old data, allowing a machine to run without taking an immediate partition snapshot causes its automated system logs to be permanently erased. By the time Professor Gino was finally permitted to hire independent forensic experts 26 months later during her tenure appeal, Harvard’s catastrophic mistake had caused the operating system logs to flush entirely. The irreplaceable digital trail that could have objectively vindicated Professor Gino by proving she did not download or manually tamper with the raw data files was permanently destroyed by the university’s own hands.

4. The Bait-and-Switch Appeal Process

The consequence of this structural blindness became glaringly obvious once the HBS internal investigation concluded in March 2023. The IC declared Professor Gino guilty of intentional academic misconduct, and the Dean subsequently stripped her of salary, health care, and campus access, requesting that the university initiate “Third Statute” proceedings to formally strip her of tenure.

Once freed from the initial HBS restriction, Professor Gino immediately retained independent forensic data experts. Working exclusively with the 2,500-page record compiled by HBS, her experts thoroughly demolished the technical findings of the Maidstone report. They exposed mathematical errors, overlooked data parameters, and structural gaps so severe that Harvard’s legal representatives quietly made an extraordinary tactical pivot: they abandoned the Maidstone report entirely.

Think about this structural absurdity: HBS had branded a tenured professor a fraud, ruined her reputation globally, and placed her family under severe financial duress based on a forensic report that the university’s own lawyers admitted was unsustainable under scrutiny.

Instead of pausing to reconsider whether their finding of guilt was fundamentally flawed, Harvard doubled down. In August 2024, midway through the tenure revocation process and after Professor Gino had spent 14 months and over $2 million drafting her defense against the original HBS charges, Harvard dropped a brand-new, 230-page expert report authored by Stanford Professor Jeremy Freese. Freese introduced entirely new data assertions, and hypothetical “falsification scenarios” never before reviewed by any HBS committee.

The appellate Hearing Committee refused to grant Professor Gino an extension of time or allow further discovery to process this structural shock. She was given barely 30 days to review, test, and write a comprehensive technical rebuttal to a completely new 230-page prosecution case, and only two days of oral hearings to present her entire defense. Rather than acting as a traditional appellate body reviewing a lower tribunal’s record, the Hearing Committee transformed into a primary, de novo fact-finding court, evaluating brand-new technical prosecution theories under an aggressive, impossible timeline.

Part II: Prosecuting Time-Barred Claims

The structural failure of institutional integrity extends to a blatant, bad-faith misapplication of Harvard’s own rules governing the passage of time. Due process universally dictates that individuals cannot be forced to defend themselves against ancient accusations. Over a decade, physical evidence degrades, intermediate electronic files are deleted, email servers flush their historical databases, and memories fade into complete obliqueness.

Recognizing this baseline of human fairness, Harvard’s explicit research misconduct policy incorporates a strict six-year statute of limitations. The rule, adopted directly from federal Department of Health and Human Services guidelines, states clearly:

“An allegation about research that is more than 6 years old cannot be investigated…”

Of the four papers brought against Professor Gino, three were ancient history. At the time of the investigation, Allegation 4 was 9 years old; Allegation 3 was 7 years old; and Allegation 2 was 6 years old. Only Allegation 1 fell within the legal six-year window. Under any straightforward textual reading of the university’s rules, 75% of the prosecution’s case was dead on arrival.

To bypass this limit, Harvard’s administration weaponized a narrow, complex exception clause in the policy. The limitation period does not apply if a scholar has:

“…continued or renewed an incident of alleged research misconduct through the citation, republication, or other use for the potential benefit of the respondent of the research record in question.”

In an act of administrative overreach, Harvard argued that because Professor Gino had included these old papers in standard, routine “string citations” or literature reviews within her subsequent work published within the last six years, she had “renewed” the initial alleged fraud.

This interpretation is an absurdity that completely eviscerates the purpose of a statute of limitations. Active academics are expected to continuously reference their foundational body of research when writing modern literature reviews. If merely citing an old paper on a university CV, a personal website, or in a background footnote of a new paper dynamically resets the clock on an investigation forever, then no academic is ever free from the burden of archiving structural records for life.

The drafters of the original federal regulation recognized this precise structural loophole. In September 2024, the rule was explicitly clarified to state that the subsequent-use exception applies only if the author explicitly makes a “citation to the portion or portions of the research record alleged to have been fabricated” to actively benefit subsequent research findings.

An objective review of Professor Gino’s background work confirms she never cited or relied upon the specific, micro-level data points under investigation. She simply cited the broad existence of the historical papers in routine professional passing. Yet, Harvard ignored this clarification, forcing Professor Gino to spend over $3 million in personal retirement and children’s education funds trying to forensically defend data operations that occurred nearly 15 years prior, where the underlying institutional servers had long since been decommissioned.

Part III: The Collapse of the Substantive Case

When the dense mathematical obfuscation is stripped away from the final 11-page report issued by the Hearing Committee—a document entirely clean of a single direct citation to the evidentiary record—the university’s substantive case completely collapses.

Harvard’s entire architecture of guilt relied on a singular, foundational assumption: The Motive Inference. The Hearing Committee asserted with absolute confidence that all identified data anomalies across all four papers pointed exclusively in the direction of Professor Gino’s stated hypotheses. Because only the principal investigator stands to professionally benefit from “juiced” or highly significant p-values, they inferred that Professor Gino must have been the entity who manually altered the files.

This premise is flatly, demonstrably false. In every single one of the four papers at issue, a massive proportion of the data anomalies either completely neutralized statistical significance, actively weakened the study’s findings, or altered cells that were entirely irrelevant to the final published analysis.

A bad-faith actor seeking to fabricate an academic career does not intentionally structure data anomalies that sabotage or dilute their own conclusions. The presence of non-directional, multi-variant formatting chaos across these files points directly away from malicious fraud and directly toward systemic human error in data formatting and management.

To understand how these anomalies actually manifested, one must look at the technical mechanics of each individual allegation.

Allegation 4: The Physical Receipts (the 2012 paper)

The oldest charge in the university’s dragnet involved a study conducted in July 2010 at the University of North Carolina evaluating the psychological effect of signing an honesty pledge at the top of a tax form versus signing at the bottom.

Harvard’s entire case rested on a digital comparison between two Microsoft Excel files: “File A” (a spreadsheet sent to Professor Gino by her UNC lab manager on July 16, 2010) and “File B” (the final data file utilized to run the published statistical analysis). Maidstone identified 73 cell value differences and three additional rows of data between the two files. Because File B produced clean, highly significant results matching the published paper, Harvard claimed Professor Gino had taken File A and manually altered the numbers to create File B.

This claim required the baseline assumption that File A was a perfect, finalized, and immutable representation of the raw data collected in the physical laboratory. It was not. In July 2010, this study was conducted entirely in real space on physical sheets of paper. Paid undergraduate research assistants were responsible for collecting the paper tests and manually transcribing the numbers into Excel.

In late summer 2023, while unpacking old, long-forgotten storage boxes delivered to her garage by HBS movers after she was placed on leave, Professor Gino made a stunning discovery: she still possessed the original physical paper payment receipts signed by the actual human subjects who sat in the UNC lab in 2010.

Professor Gino’s defense team painstakingly cross-referenced the human names, subject IDs, and exact cash payment distribution fields on those physical receipts with the digital parameters of the files. The results were definitive.

Data parameters assessed	File A (Harvard baseline)	File B (published file)	Physical human receipts
Participant alignment	Discrepancies in subject counts	Perfect match	Perfect match
$7 cash payment field	Claims 3 subjects received $7	Accounts for 2 subjects	Documents 2 payees total
$16 cash payment field	Claims 14 subjects received $16	Accounts for 11 subjects	Documents 11 payees total

The physical proof demonstrated that File A was an incomplete, error-ridden working draft compiled by an RA mid-study. File B was the only file in existence that accurately accounted for the real human beings who actually participated in the room. Professor Gino had not changed the data; the lab staff had simply finished transcribing the incomplete paperwork after File A was saved.

Astonishingly, the appellate Hearing Committee completely ignored this physical baseline. In their final written determination, the committee stated:

“Professor Gino claims to have reviewed the original paper receipts completed by study participants and verified that the later data on which her analysis relied are accurate. She did not, however, provide those receipts…”

This statement is flatly, shockingly false. The actual scanned copies of the physical paper receipts were explicitly submitted into the university’s formal record, where copies of the receipts were documented at Exhibits RX 626A and RX 626B. The part-time faculty panel simply failed to read the critical evidence sitting in their own docket, convicting a professor of fraud by explicitly denying the existence of physical documents logged right in front of their eyes.

Allegation 3: The Mechanical Turk Cheaters (the 2014 paper)

Allegation 3 focused on a 2014 paper titled “Evil Genius: How Dishonesty Can Lead to Greater Creativity,” which explored whether breaking rules triggers a subsequent cognitive boost in creative problem-solving. In Study 4, online subjects were asked to predict a virtual coin flip, record their guess, execute the flip, and self-report whether they guessed correctly. Because the study paid more cash for correct guesses, subjects faced an immediate incentive to lie.

Harvard’s investigators discovered that 12 specific participants had recorded a value of “0” in the survey field indicating they had not guessed the coin flip correctly—meaning they told the truth about losing the game. Yet, in the final dataset column explicitly marked “cheated,” these same 12 individuals were coded with a value of “1” (marked as cheaters). Harvard asserted that Professor Gino had manually switched these 12 honest participants into “cheaters” to artificially boost the correlation between cheating and creativity.

This charge represents a total failure to comprehend how online behavioral studies operate. The data for this study was not collected from controlled students on a university campus; it was crowdsourced online using Amazon Mechanical Turk (MTurk). MTurk workers are digital gig laborers paid micro-fees based on tasks completed. Critically, MTurk workers live or die by their platform reputation ratings; if an academic platform flags a worker as a “bot” or an “inattentive responder” who fails basic quality checks, their account rating drops, blocking them from future employment.

When Professor Gino’s independent defense team went back to analyze the underlying characteristics of these specific 12 participants, a clear, logical pattern emerged.

Following the coin flip, all survey participants were given a rigorous cognitive test called the Remote Association Test (RAT), consisting of 17 distinct, objective word puzzles. The 12 suspect participants achieved an astronomical mean performance score of 11.2, compared to a baseline average of just 7.7 for the rest of the general population—a statistical variance that is significant (p = .02). Astonishingly, 100% of these 12 individuals perfectly solved highly complex, specific puzzles that stumped over 30% of the entire study group.

But when these same 12 individuals were evaluated on the subsequent “Usage Task”—which asked them to write out original, creative uses for a newspaper under a one-minute timer—their creativity scores completely flattened to the dead average of the group (6.83 versus 6.61, p = .77).

Why would someone be an absolute cognitive genius on word puzzles, but completely uncreative on an open-ended writing prompt?

The answers to the RAT puzzles are fixed, static, and highly indexed across the public internet. Anyone can open a secondary browser tab, type the three word prompts into a search engine, and instantly copy the correct answer. By contrast, an open-ended writing prompt under a rolling visual timer cannot be pulled from a Google search page.

The Qualtrics metadata logs provide the final, definitive link. The 12 suspect participants took an average of 436 seconds to complete the puzzle page, compared to just 340 seconds for the rest of the population (p = .07). They took significantly longer because they were actively gaming the system: reading the prompt, opening a separate browser tab, running a search engine query, copying the digital answer, and pasting it back into the survey to protect their Amazon worker profile rating.

They were actively violating the explicit, bolded instructions on the survey:

“Please do not use any help other than your own knowledge.”

The existence of two distinct columns in the spreadsheet—“reported guessed correctly” and “cheated”—was not a sign of data tampering; it was standard data architecture. The “cheated” column was structured to allow a data analyst to run a multi-dimensional check, capturing both the coin-flip liars and the MTurk workers who forensically cheated on the timer parameters.

An undergraduate research assistant working through the raw files would look at these massive timing delays and perfect puzzle scores, recognize platform cheating, and manually code the variable field to “1” to filter the study accurately. Harvard convicted Professor Gino of fraud because she possessed data columns that successfully caught online scammers—and they did so without ever interviewing a single one of the five research assistants who ran the file operations over 500 days.

Allegation 2: The Scammer and the Excel Drag Error (the 2015 paper)

Allegation 2 centered on a 2015 paper, “The Moral Virtue of Authenticity,” which utilized a Qualtrics survey distributed to Harvard College undergraduates. Data Colada initially flagged 20 anomalous entries within the spreadsheet where the demographic column asking for “year in college” contained the text string value “Harvard.” Data Colada reasoned that no real undergraduate would answer a query about their class year by typing the name of the university they currently attend. HBS immediately assumed Professor Gino had fabricated these 20 rows out of whole cloth to boost her sample size.

This accusation collapsed under professional forensic IT scrutiny. Professor Gino’s defense experts extracted the hidden Qualtrics digital metadata headers that Harvard’s internal team had completely missed because they failed to read the software manual.

The metadata revealed a clear digital footprint: all 20 suspect rows were submitted from an archaic, long-outdated browser configuration—Firefox Version 5 running on a 2001 Windows XP operating system. In 2014, Firefox was already tracking at Version 27, and Professor Gino worked exclusively on an Apple Mac ecosystem.

Furthermore, the entries originated from 20 entirely separate IP addresses routed through commercial overseas data hosting centers using an automated proxy switcher. The entries completed sequentially, with millisecond precision, one immediately after the prior row ended, utilizing randomized, non-existent free email addresses.

The 20 rows were not a fabrication by Professor Gino; they were an automated attack executed by an offshore click-farm bot designed to bypass survey gates to harvest free $10 Amazon gift cards. Confronted with this undeniable technical reality, the appellate Hearing Committee was forced to fully concede the point, formally declaring Professor Gino completely innocent of the primary charge that had launched the entire global scandal.

Yet, having tracking data that proved their core accusation was a complete mistake, Harvard refused to withdraw the prosecution. Instead, they shifted focus to a cluster of 154 cells that showed numeric changes between an intermediate working version of the dataset and the final public upload. Harvard’s new expert, Professor Freese, constructed a hypothetical “falsification scenario” asserting Professor Gino had manually tweaked these 154 specific cells to align with her hypotheses.

Professor Gino’s defense experts mapped the geometry of these changes and discovered something vital: the changes were not randomly or strategically distributed among the 2,455 available conclusion-altering cells in the sheet. Instead, they were perfectly constrained within a solid, continuous geographic block measuring exactly 9 columns wide by 8 rows deep.

This exact geographic pattern matches a standard structural function unique to Microsoft Excel called “Cut and Insert” (Shift-Drag).

In behavioral science research, undergraduate RAs are continuously forced to manually merge datasets because lab data and online survey data arrive in completely mismatched column formats. If an assistant highlights a block of data to realign rows, but inadvertently holds down the Shift key on their keyboard while dragging their mouse, Excel automatically executes a structural swap. It lifts the target block but simultaneously takes whatever data was sitting at the destination drop point and drops it back into the origin cells—silently trading their places without throwing an administrative pop-up warning.

Professor Gino’s experts proved that a single, accidental Shift-Drag gesture executed by an RA trying to line up mismatched columns account for over 94% of the entire cell variance identified by Harvard. The remaining 6% was perfectly accounted for by a secondary common command shortcut: Control-Drag (Copy and Replace) executed exactly twice.

The mathematical probability that Professor Gino went into a spreadsheet and manually tinkered with 154 individual cells—including making completely irrelevant alterations to 74 cells that had no bearing on professional publication metrics—in a geometry that accidentally, perfectly matched a standard Microsoft software keyboard glitch is practically zero.

The Hearing Committee was presented with a deterministic, replicable software formatting explanation that completely cleared the accused. They simply chose to disregard it, preferring to assert that Professor Gino was an intentional fraudster who manually structured her cheating to look exactly like an accidental keyboard error.

Allegation 1: The SPSS Logs (the 2020 paper)

The final allegation against Professor Gino was the only timely claim under the university’s rules, involving a 2020 paper published in the Journal of Personality and Social Psychology. Harvard’s prosecution team claimed to have found a definitive “smoking gun” within the metadata of the statistical analysis program SPSS.

Professor Freese asserted that on January 24, 2020, Professor Gino downloaded a raw, clean dataset from Qualtrics and spent an entire afternoon executing a highly calculated loop of deception: tracking her micro-adjustments until she achieved maximum statistical significance.

This assertion was a complete fabrication that relied entirely on the assumption that the panel members would not actually audit the 63,000 lines of underlying SPSS code. Professor Gino’s defense team stripped the automated software formatting and reviewed the command syntax line-by-line. The actual log architecture proved the looping theory never happened.

Across three entirely separate SPSS sessions spanning that afternoon and evening, Professor Gino had four completely separate datasets open simultaneously, representing three entirely different research projects. She ran a total of 61 distinct data analysis commands. Out of those 61 commands, only six occurred more than once for the study under investigation. Four occurred exactly twice, and two occurred exactly three times.

Crucially, the logs document that when Professor Gino ran a duplicate variance analysis (UNIANOVA), she issued the command twice in immediate succession without ever reloading the data file. If a researcher is changing cell metrics within an external Excel sheet and testing the impact of those alterations in SPSS, the software cannot see the changes unless the analyst explicitly re-imports the updated file layout. Issuing a command twice on an un-reloaded file yields the exact same numeric output down to the last decimal place.

Professor Gino was not executing a calculated loop of fraud; she was engaging in routine, messy data exploration—toggling between filtered sample subsets, adjusting baseline control variables, getting interrupted by administrative office meetings, picking up her children, and occasionally re-running a calculation command because it was faster than scrolling through pages of code history. Her youngest child was just two and a half months old at the time. The SPSS logs perfectly capture the fractured, distracted work pattern of a busy working mother and academic managing multiple projects—not an evil genius calculatedly juicing a spreadsheet.

Prosecution’s claim: Gino downloaded raw data on Jan 24 to clean it herself. — Forensic reality: Her primary RA explicitly testified that he had already cleaned and prepared that exact dataset prior to Jan 24.
Prosecution’s claim: 1,066 cell changes were strategically engineered to prove a motive. — Forensic reality: 415 of those 1,066 cell changes (39%) altered data fields that were completely irrelevant to the published hypothesis.
Prosecution’s claim: Forensic experts agreed with the committee’s timeline. — Forensic reality: Both Harvard’s own forensic consultant and Gino’s expert formally testified that she worked from an existing file, not a raw Jan 24 download.

Harvard’s Hearing Committee chose to completely throw out the aligned testimony of both forensic experts, reject the explicit statement of the research assistant who ran the file operations, and overlook the fact that 39% of the file data changes were functionally useless to publication metrics, all to preserve an administrative predetermination of guilt.

Conclusion: No Deference Without Due Process

When public courts extend judicial deference to internal university tribunals, it is based on a foundational legal assumption: that the institution operated in structural good faith and adhered to the baseline requirements of procedural fairness.

Harvard University did not.

Driven by institutional panic and a desperation to placate internet data vigilantes, the university administration actively dismantled its own constitutional guardrails. They locked a star scholar in an administrative cage under a two-year gag rule, stripped her of the technical parity required to mount an expert defense, destroyed the digital hard drive logs that could have objectively proven her innocence, ran past their own explicit six-year statute of limitations, and blatantly refused to acknowledge physical evidence explicitly logged in their own trial record.

Between administrative error and malicious fraud, the vast majority of discrepancies in complex, multi-merged scientific files are born of human formatting error. Harvard took routine spreadsheet processing anomalies—born of offshore gift-card click-bots, Amazon Mechanical Turk gig-worker habits, and standard Microsoft Excel realignment glitches—and twisted them into evidence of personal, intentional culpability.

The pursuit of truth (Veritas) cannot be severed from procedural fairness. Francesca Gino was not stripped of her tenure based on clear and convincing evidence. She was stripped of it by bureaucratic inertia, administrative pride, and a catastrophic failure of institutional courage.

This unprecedented judgment is an embarrassment to a great university, a severe threat to academic safety nationwide, and a structural injustice that cannot be allowed to stand.

Videos

02 / Watch

This section includes AI-generated video overviews. The content is compelling but can get a bit weird at times; we offer it weirdness and all. A transcript correcting any errors is provided for each.

Insight Engine — Anatomy of an Academic Conviction

Generated with NotebookLM

Transcript

The Burden of Proof

Generated with NotebookLM

Transcript

The Evidentiary Burden

Generated with NotebookLM

Transcript

AI-Generated Podcast Summary

03 / Listen

This section includes an audio summary of the case, generated by NotebookLM. This is roughly the equivalent of the long written summary, but produced as a two-host conversation (and so much more compelling).

Listen to the AI podcast summary

Hosted on NotebookLM · opens in a new tab

A two-host conversation walking through the case in roughly the same scope as the long written summary.

Transcript

Explore in NotebookLM

04 / Interact

Ask questions of the source material directly. The NotebookLM has been loaded with the podcast transcripts and supporting documents — and any new artifacts produced will appear there as well.

Open the NotebookLM

Browse the sources, view auto-generated artifacts, or ask your own questions in chat.

Open →

The Full Podcast

05 / Go Deep

Nine episodes covering the full range of the case — The Law, Such As It Is. This is for listeners who want the complete record after working through the AI artifacts above. You can read the transcripts by clicking the tab at the top of the page.