GuideLast updated April 20, 2026

Research Methods Guide: Study Design, Statistics, and Reporting

Q: Do I need ethics approval for a literature review or systematic review?

Most institutions consider systematic reviews of already-published literature exempt from formal ethics review because they do not involve new human-participant data. Always check your institution's policy: some require a brief notification or expedited review, particularly when individual-participant-data (IPD) meta-analysis is planned. Primary research involving human participants, their data, or biological samples always requires full ethics approval.

Q: How do I calculate sample size?

Sample size depends on effect size, desired power (typically 0.80), significance level (usually 0.05), and the statistical test you intend to run. Cohen's seminal text remains the standard reference for power calculations across the major test families. Tools such as G*Power, the R `pwr` package, and PASS implement these calculations. Pre-specify the sample size in the protocol before data collection begins, not after.

Q: Should I use a parametric or non-parametric test?

Use a parametric test (t-test, ANOVA, linear regression, Pearson correlation) when the data are continuous, approximately normally distributed, and the sample is reasonably large. Use a non-parametric alternative (Mann-Whitney, Kruskal-Wallis, Spearman correlation) for ordinal data, small samples, or skewed distributions. Always check assumptions explicitly rather than defaulting to parametric methods because they are familiar.

Q: What is the difference between pre-registration and a registered report?

Pre-registration is a public, time-stamped record of the protocol (typically on OSF, AsPredicted, or PROSPERO for systematic reviews) made before data collection or analysis begins. A registered report goes further: the protocol is peer-reviewed and provisionally accepted by a journal before the study is conducted, with publication contingent on adherence to the protocol rather than on the results.

Q: Do I need to comply with GDPR if I am not based in the EU?

GDPR applies to any research collecting, processing, or storing data on EU residents, regardless of where the researcher is based. HIPAA applies to US protected health information. The Australian Privacy Principles (APPs) cover personal information held by Australian organisations. If your research crosses jurisdictions, you may need to satisfy multiple frameworks, and the strictest of them in practice sets the floor.

Q: What is the right reporting standard for my study design?

Use CONSORT for randomised controlled trials, STROBE for observational studies (cohort, case-control, cross-sectional), COREQ for qualitative research, TRIPOD for prediction-model studies, and PRISMA 2020 for systematic reviews and meta-analyses. The Equator Network catalogues hundreds of reporting standards and is the canonical entry point for finding the one that fits your design.

From ethics applications to statistical analysis: foundational knowledge for rigorous research across all disciplines.

By Dr Mitch Bishop, Systematicly Research Lab16 min readStudy DesignStatisticsEthicsReproducibilityReporting Standards

Research study design grid: RCT, cohort, case-control, cross-sectional, qualitative, mixed-methods

Research methods are the structured procedures researchers use to ask questions, collect evidence, analyse it, and report what they found in a way that other researchers can scrutinise and build on. The methods you choose, and how rigorously you apply them, determine whether your findings can support clinical decisions, policy changes, or further investigation. Sloppy methods produce results that even the researcher should not trust.

This guide walks the research lifecycle in order: framing the question and choosing a design, planning the sample, securing ethics approval, selecting the right statistical tests, managing the data and protecting participant privacy, writing for reproducibility, reporting against the appropriate standard, appraising the result, and avoiding the methodological pitfalls that catch new and experienced researchers alike. Every step has documented best practice. The discipline is in following it.

Key takeaways

The research lifecycle moves from question to design to ethics to data to analysis to reporting; each step has documented standards.
Match the study design to the question: RCT for causal claims about interventions, cohort for prospective association, case-control for rare outcomes, qualitative for meaning and process.
Pre-specify sample size and statistical analysis before data collection; post-hoc decisions invite p-hacking and HARKing.
Ethics approval is non-negotiable for human-participant research; literature reviews are usually exempt but always check your institution's policy.
Choose the statistical test from the data's properties (type, distribution, sample size), not from familiarity. Tooling can surface candidate tests but does not replace statistical reasoning.
Report against the standard that fits the design: CONSORT for RCTs, STROBE for observational studies, COREQ for qualitative, TRIPOD for prediction models, PRISMA 2020 for systematic reviews.
Pre-register the protocol, share the data and code where possible, and provide a complete audit trail of analytical decisions.

How Systematicly makes research methods easy

Systematicly supports the analytical and reporting steps of the research lifecycle: feasibility analysis at protocol stage, plain-English statistical-test selection over 1,100+ functions, dual-AI extraction with human-in-the-loop verification, and a full audit trail of every action taken in the project.

Feasibility Analysis at protocol stage: literature volume, heterogeneity signals, timeline and resource estimates before you commit.
Plain English Statistical Analysis: describe the comparison you need in everyday language and Systematicly surfaces the right test from 1,100+ options.
Audit-trail logging for every AI decision, human review, and cross-verification, supporting reproducibility and compliance.

Related deep dives

What are research methods?

Research methods are the structured procedures used to investigate a question and produce evidence that can be appraised, replicated, and built on. They span the full lifecycle: framing the question, choosing a design, planning the sample, securing ethics approval, collecting and managing the data, analysing it, and reporting against an appropriate standard.

Two dimensions distinguish broad approaches. Quantitative methods use measurement and statistical inference to estimate effects, associations, or population parameters. Qualitative methods use interviews, observation, and document analysis to surface meaning, process, and context. Mixed-methods designs combine the two, typically when the question has both a measurable component and a contextual one that quantification alone misses.

Choosing a study design

The study design is the strategy for answering the research question. The choice is constrained by what kind of claim you want to make, what is feasible to collect, and what is ethically permissible.

Randomised controlled trial (RCT). The reference standard for causal claims about interventions. Random allocation balances measured and unmeasured confounders in expectation. Reported per CONSORT.
Cohort study. Follows defined groups forward in time, comparing exposed and unexposed. Stronger for incidence and temporal sequence than for rare outcomes. Reported per STROBE.
Case-control study. Compares cases with controls retrospectively. Efficient for rare outcomes but vulnerable to recall and selection bias. Reported per STROBE.
Cross-sectional study. A snapshot of a population at one time point. Good for prevalence; cannot establish temporality. Reported per STROBE.
Qualitative study. Interviews, focus groups, or ethnography. Answers questions about meaning, process, and lived experience that quantification cannot. Reported per COREQ.
Mixed methods. Combines quantitative and qualitative approaches; used when neither alone answers the question.
Systematic review. A structured synthesis of existing primary studies. The right tool when the question has been studied and the disagreement is in the synthesis, not the primary evidence. Reported per PRISMA 2020.

Compare review designsSystematic Review vs Meta-Analysis: Key Differences ExplainedGuide · March 25, 2026 · 7 min read

Sampling and statistical power

The sample is the bridge between the study and the population. Probability sampling (simple random, stratified, cluster) supports inference to the population; non- probability sampling (convenience, purposive, snowball) does not, but is sometimes the only feasible option for hard-to-reach populations or qualitative work.

Sample-size calculation pre-specifies how many participants are needed to detect a clinically meaningful effect with adequate statistical power, typically 0.80, at a chosen significance level, typically 0.05. Cohen's foundational text^[1] is the standard reference, with tools such as G*Power and the R `pwr` package implementing the calculations. Underpowered studies routinely fail to detect real effects and contribute to the broader replication crisis.

Systematicly | Feasibility Analysis

Viable

Literature Volume

2,847 records

across 3 databases

Heterogeneity

Moderate

I² est. 40 to 60%

Timeline

4 months

with 2 reviewers

Resources

Standard

no specialist skills required

✓ Proceed: sufficient evidence base for systematic review

Feasibility Analysis: estimate whether a review topic is viable before committing months of work.

Feasibility analysis at protocol stage looks beyond statistical power to the broader question of whether the study can realistically be done: is there enough relevant literature to inform the design, is the timeline plausible with the available team, and are the resources and skills required within reach. Honest feasibility work at the front end saves months of wasted effort at the back end.

Ethics applications and IRBs

Research involving human participants, their identifiable data, or their biological samples requires ethics approval before data collection begins. The standard is set by the Declaration of Helsinki^[2] and operationalised through institutional review boards (IRBs in the US), human research ethics committees (HRECs in Australia and the UK), and equivalent bodies elsewhere. ICH-GCP^[3] is the international standard for clinical trials.

Informed consent. Participants must understand what is being asked, what the risks are, and that they can withdraw without penalty. Special protections apply to children, prisoners, and other groups with constrained autonomy.
Risk minimisation. The application must show that risks are minimised and reasonable in relation to anticipated benefits.
Data management plan. A description of how data will be collected, stored, accessed, retained, and ultimately destroyed or shared.
Privacy protections. Compliance with the relevant framework: GDPR (EU), HIPAA (US health), the Australian Privacy Principles (APPs) locally, or the equivalent.

Systematic reviews of already-published literature are usually exempt from full ethics review, but always confirm with your institution. Individual-participant- data meta-analyses (IPD) typically require additional approvals because they involve participant-level data that may be re-identifiable.

Statistical test selection

The right statistical test follows from the data, not the other way around. The selection process is a small decision tree: what type of data (continuous, ordinal, categorical, time-to-event), how is it distributed (normal, skewed, count), how large is the sample, and what comparison are you making (between groups, within participants, association between variables, prediction).

Systematicly | Data Analysis

Is there a significant difference in blood pressure between treatment and control?

Independent Samples t-TestCompare / Contrast

t-statistic

-3.85

p-value

< 0.001

Mean (Tx)

128.40

Mean (Ctrl)

142.10

Cohen's d

0.72

Plain English Statistical Analysis: describe your question in everyday language and Systematicly selects and runs the right test from 1,100+ functions.

Plain-English test selection helps you find the test you already know you need. It does not replace the statistical reasoning behind the choice. The infographic above is illustrative: the input is a question, the output is a candidate test family, and a competent researcher still needs to verify that the assumptions hold and the result is interpretable.

Parametric tests (t-test, ANOVA, linear regression, Pearson correlation) assume continuous, approximately normally-distributed data and are more powerful when their assumptions hold.
Non-parametric alternatives (Mann-Whitney, Kruskal-Wallis, Spearman correlation) handle ordinal data, small samples, and skewed distributions without distributional assumptions.
Generalised linear models extend regression to non-normal outcomes (logistic for binary, Poisson for counts, Cox for time-to-event).
Multilevel and mixed models handle hierarchically clustered data (patients within clinics, students within schools).
Bayesian alternatives for any of the above, when prior information is informative or when posterior probability is more interpretable than a p-value.

Worked exampleHow to Screen Studies for a Systematic Review: A Practical GuideGuide · March 25, 2026 · 9 min read

Data management and privacy

Responsible data management is the operational backbone of reproducible research. It starts with the protocol's data-management plan and runs through collection, storage, access control, sharing, and ultimate retention or destruction. The practices that look like overhead at the start of a project look like obvious prudence by the time of submission, peer review, or audit.

Systematicly | Audit Trail