Try   HackMD

Git masterclass blog post draft

My github handle

samumantha, bast,

Contributing

Open House notes: https://hackmd.io/@coderefinery/oh_git_masterclass

Please work on any section that sounds interesting to you in this document. Please do not forget to add yourself as an author. Once we have a first full draft we move to Github.

Note that the materials have now been split into two separate blogposts. One more on the organizational side (done, see https://coderefinery.org/blog/open-house-git-masterclass/) and one on the materials and topic collection (help needed; this document).

Rough general todo:

  • introduction
  • existing lesson overview
    • add links with names
    • add more info on the lessons
  • Summarize topics and their coverage
    • Recovering
    • History
    • Collaborative workflows
    • [?] Organizing projects
    • [?] User interfaces and experience
    • Other topics
  • Incorporate "other notes" where fitting
  • Summarize gaps identified, if possible
  • Acknowledgements
  • Outlook, the grand plan

Deadline for first draft is end of January.


Drafting space

+++
title = "Advaced git / git masterclass materials and topic collection"
description = """
Collected materials and topics from our open house session on advaced git / git masterclass on 14.01.25
"""

[extra]
authors = "Samantha Wittke, Radovan Bast, "
+++


TL;DR

TODO: Write summary


Background

On January 14, 2025, we held an Open House session on "Git masterclass". Educators and practitioners from various institutions and communities came to share resources and discuss topics usually not taught in basic git classes. This blogpost provides an overview of the materials and topics discussed. Please refer to https://coderefinery.org/blog/open-house-git-masterclass/ for an overview of the OpenHouse session itself.

Material we know about

Advanced Git in Carpentries incubator (pre-alpha)

Other incubator projects on intermediate and advanced Git

Git - beyond the basics

Heidelberg University material for intermediate/expert Git courses

e-Science Center
Collaborative version control with Git and GitHub

CodeRefinery

Git and GitHub through GitKraken : From Zero to Hero

University of Sheffield - Git Collaboration

KIT - Intermediate git
Mostly CodeRefinery material, but with some additions:

zedif Jena - Collaborative Version Control with Git: An Advanced Workshop

Basic GitHub but including pull requests and branching

Met Office Resources - Custom Software Carpentry Lessons

Other resources

A list of other resources that can be helpful when learning about or teaching git:

Topic collection

Recovering from local mistakes

Changing history

  • How to modify an open pull request/ merge request - responding to code review with changes (rebase, squash, edit commits)

  • How to remove something from the history

    • Start with git revert and how it doesn't actually remove from hisotry. Removing from history completely can be done using BFG-repo-cleaner.
  • Interactive rebase to squash/delete/re-order commits

    • Gradually introduce rebase: Amending/Fixup first, then interactive rebase. Do not overwhelm with full power, links into creating clearer history while working. | +x
    • XKCD comic as motivation to git rebase. This repository implements the git history of that comic and fixes it: https://github.com/ssciwr/git-rebase-xkcd-example (Details in Dominics material)
    • Squashing commits into a logical unit using reset
  • Creating clearer history while working

    • Partial staging and committing
    • Pulling with rebase
  • Temporary work and stashing

    • Small episode in Dominics material
  • Atomic commits, using git commit --amend and git commit --fixup which provide a gentle introduction to git rebase -i (see above) | +3

  • History inspection

    • Which branches and commit have you worked on? (Analysing/Debugging history - Meta Level)
    • Searching through code changes with "pickaxe"
      • Finding a commit that introduced a bug with git annotate or bisect
      • Example repository with two bugs hidden in history (1 functional, 1 performance): https://github.com/ssciwr/git-bisect-example (Details in Dominics material)

Collaborative workflows

Combining changes

  • More advanced merge situations
  • Cherry-picking to apply change from a single commit (as alternatives to merge)
  • Comparing rebasing with merging and understanding when to prefer one or the other
  • Rebasing across conflicts
    • First should cover the two ways of updating branches, either git merge or git rebase and explain why and when you might prefer one of the other. Both can result in conflicts though!
    • Then go on to explain how to resolve conflicts.
    • Some of this is covered in the Diverging branches chapter

User interfaces and experience

  • Nice life hacks in .gitconfig
    • includeIf to have custom configuration based on directory path.
    • using aliases
    • conflictstyle = diff3 under [merge] to not only see a conflict but also what the original code was when resolving
    • defaultBranch = main
    • pushDefault = origin
    • Aliases to avoid long complicated commands
    • how to not reveal your email address if you prefer to not reveal it
  • Using GitHub/GitLab command line utilities (PRs from command lines)
  • Collaborate between Windows users and *nix users (line ending issue)
  • git hooks
  • GitHub actions / GitLab CI/CD
    • running GHA locally: act
    • ssh-ing into failed job runners using tmate and debug the issue on GitHub
  • Git shell prompts
    • Oh-my-zsh (default on OSX) provides nice Git prompts via themes.
    • how to customize fish shell for Git
  • how to use GUI for line-based "git add -p" commands, difftool, mergetool
  • excellent tool for diffing and integrates easily with Git (see git configuration)
    • delta - syntax-highlighting pager that can be customized
    • meld as difftool/mergetool
  • Signing commits can be a pain point as GPG is a bit of a pain to use in and of itself.
    • Can sign with SSH keys these days.
  • How to some ignore files globally for all your repos (these are typically ignore patterns that relate to individual settings rather than ignore paths that would be useful for all)

Organizing projects

Suggestion: this topic belongs on a 'RSE/Data Manager track' of git training, separate from a 'researcher/user track' that focuses more on daily usage, recipes etc - 'history' & 'recovering' sections.

Best practices:

  • How to write a 'proper' git commit message
  • What to do when you have hundreds or thousands of repositories in various states of completion/abandonment, e.g. using GitHub's labels and/or other features
  • Naming conventions

These best practices could be served by e.g. blog posts disseminated via community forums, but they probably don't need to be part of a formal course.

Resources:

  • For keeping track on multiple repositories / places using DataLad could be a solution (based on git-annex)
  • CleanCode as setting a base for the (code in the) repository

Acknowledgement

This collection and overview was developed collaboratively during the OpenHouse session. Thanks goes out to all participants: Radovan Bast, Lukas C. Bossert, Richard Darst, Nishka Dasgupta, Jonathan Hartman, Marc-André Hermanns, Diana Iusan, Dominic Kempf, Christian Knüpfer, Michele Mesiti, Iva Momcheva, Joe Marsh Rossney, Neil Shepard, Jannetta Steyn, Dimitrios Theodorakis.

Outlook

In this Open House session, we looked back: we shared experiences and links to already existing materials. We also preliminary identified topics of interest to the community based on previous experiences.

For the next session, we aim to look into the future: What lessons are missing, where do we put them and how do we collaborate on something useful for the community to mix and match as needed. If you want to be part of this discussion, please contact support@coderefinery.org.