In this lab, we will continue with two ideas that we introduced in lecture: understanding how data mutation works and testing functions that mutate data.
When a program is maintaining data over time, sometimes we want that data to be shared across different parts of the program and sometimes we do not. Decisions about what should or should not be shared affect how we set up our data and write programs.
Consider a tool (like Google Docs) that allows people to create documents and share them with other people. Here's a basic dataclass for Documents.
When we call something like Document("Homework 1", ...)
, we are using the Document
constructor to create a new piece of Document
data.
datetime
?datetime
is a Python library that provides support for working with dates and times. This comes in handy when you are, for example, trying to take the difference between two times. You can create a new datetime
using the constructor
datetime(year, month, day, hour=0, minute=0, second=0, microsecond=0)
(The =0
means that those inputs are optional with a default value of 0).
Another basic dataclass captures people who are allowed to edit documents:
Imagine that the CS111 homeworks team is preparing assignments that are in different stages of readiness. Some documents are ready for sharing with others, but some stay private until the creators are ready to share them.
Task 1: Copy the above code fragments to a Python file. Add variables and definitions to set up the following set of editors and documents with their corresponding sharing settings:
hwk1
variable from the examples above as part of constructing the Nya Editor
, just like the amaris
example above)Task 2: Draw the memory diagram for the current collection of Editors and Documents.
Consider the following program:
We could draw our memory diagram like this, where each box represents our data in memory. Each box resides in some location in memory (written as locXXXX). When a variable or field references another piece of data in memory, we use the location to identify the piece of data.
If we want, we can highlight the references between variables/fields and data by also drawing arrows that connect a reference to a location with the actual piece of data. The arrows, however, mask a key piece of what is happening: that data live in spots in memory with unique numeric identifiers.
An alternative way of drawing memory diagrams is to use the spreadsheet template we saw in class. The same diagram as above would be represented like this using the spreadsheet:
It is critically important that names from the directory do not appear in the heap area/inside the data. Using the names suggests that later updates to the variable values (like prairie = ...
) would affect the data in the heap, but this is NOT how languages work.
Task 3: Extend your memory diagram to show Amaris being the sole editor of a copy of Alex's current homework 2 document.
Task 4: Add some code to your file that would create the same memory contents as what you just drew to give Amaris a copy of homework 2.
Amaris and Nya have been working on homework 1, but their idea isn’t working out. It’s such a mess that they have decided to just throw away their old attempt and start over.
Task 5: Here are two ways to “start over” on homework 1. What effect will each have on the memory diagram? After taking each approach (separately), what contents will Python produce if we ask to see Nya’s version of hwk1?
One of the most subtle concepts to internalize when starting to work with updating data is the difference between updating variables and updating contents of data structures. The previous task highlights the differences.
Task 6: To help you internalize these details, please talk through the following questions with your partner.
If we update what hwk1
refers to using approach 1, which editors will see the changes?
If we update what hwk1
refers to using approach 1, will there be any way to access the original hwk1
contents?
If we update the contents of hwk1 using approach 2, which editors will see the changes?
If we update what hwk1
refers to using approach 2, will there be any way to access the original hwk1
contents?
Creating, sharing, editing, and copying documents happen so often that it would make sense to have functions corresponding to those tasks.
Task 7: Write a function create
that takes an Editor
and a document title. The function creates a new document, stores it in the editor's list of docs, then returns the new document.
Task 8: Write a function edit
that takes a Document
and a string (for the new content) and replaces the document contents with the new text.
Task 9: Write a function share
that takes a Document
and an Editor
and adds the document to the editor's list of docs.
Task 10: Write a function copy
that takes a Document
and returns a copy of the document. The copy has the same title and contents as the original, but the last_edited
component is the current date and time (use datetime.now()
to get this).
Task 11: Look back at your four functions: which ones return something and which ones don't? Focus on the ones that don't return anything. Do they "accomplish" anything? Can you articulate what a function that doesn't return anything has to "do" in order to be of use? Write down your answer. (hint: think about our append_8
example from lecture)
Now, we have to figure out whether the functions you've written provide the sharing conditions that we had in mind. In particular, think about the edit
, share
, and copy
functions.
Your goal is to fill in this template of a testing function:
Task 12: Identify an initial scenario of editors and documents against which you will run your tests. Create the corresponding data in the "SETUP" area of test_docs
.
Task 13: Write down (in prose) a list of effects that these operations should have. Put these in a multi-line string comment above your test_docs
function in your file:
Task 14: Write down a list of effects that these operations should not have. For example, "if one document is edited, then the contents of another document should not change". Add these in a separate comment like the one above in your file.
Task 15: Make a collection of calls to the edit
, share
, and copy
functions in the "PERFORM MODIFICATIONS" section. These calls should be sufficient to perform the checks that you outlined in your lists of effects.
Task 16: Turn your expected and unexpected effects into pytest assertions, putting those in the "CHECK EFFECTS" section of the testing function.
In a large course staff like we have for 111, having a separate variable for each TA/Prof makes our code a bit hard to manage. It therefore makes sense to make a list to hold all of the editors on our documents, as follows:
To help us keep track of all the documents we are working on, we want to make two additions to our current code: we want to be able to ask which staff have access to a particular document, and we want to maintain a count of how many documents we have.
Let's start with the latter. We'll set up a new variable to maintain the count of documents that we have created. We'll also modify our copy
function to update this count.
Task 17: Create a global variable called num_docs
and modify your copy
code in a function called copy2
to also increment num_docs
whenever a document is copied.
Task 18: The following code computes a list of all staff who can access a specific document.
The who_has
function indents the return
under the for
loop. We could also have indented the return
at the corresponding comments. Discuss with your partner what each return position would result in compared to the original return
position.
Brown University CSCI 0111 (Fall 2023)
Do you have feedback? Fill out this form.