--- title: Homework 5 tags: Homeworks-F24 --- # Homework 5: Much Ado about Lists! **Due:** Wednesday, October 23rd, 2024 at 11:59 PM ET. ### Setup - We are not using a stencil/helper code file for this homework! - Create a file in [Pyret](http://code.pyret.org) called `hw5-code.arr` where you will write your Homework 5 solution. - Create a file in your preferred word processor (Google Docs, Word, TextEdit, WordPad, etc) called `hw5-src.pdf`. This will hold your written work. - Do **not** put your name anywhere in the files. ### Remember your resources! - [Pyret documentation](https://www.pyret.org/docs/latest/) - [Pyret String Documentation](https://pyret.org/docs/latest/strings.html) - [Pyret List Documentation](https://pyret.org/docs/latest/lists.html) - [CS0111 Table documentation](https://hackmd.io/@cs111/table) (use this **instead** of the official `Table` documentation) - [TA hours](https://brown-csci0111.github.io/pages/calendar.html) - [Edstem](https://edstem.org/us/courses/64952) ## The Big Picture of this Assignment This assignment has two goals: to give you some practice writing list functions, and to step back and give you a chance to show us what you have learned about writing clean code and developing a good set of examples for a problem. It's something of a mid-semester checkpoint on your program development skills. We'll be paying particular attention to your code organization and testing in grading this assignment. :::info Learning Goals - To practice writing recursive functions over lists - To practice using map and filter functions - To practice testing list programs - To practice writing your own datatype ::: This assignment draws on some basic ideas about analyzing texts. For some of these problems, we will tell you whether to use the built-in list operations (like `map` and `filter`) or an explicit recursive function (written with cases and based on examples, like we've been doing in lecture). ## The Assignment Barry the Bee is a self-proclaimed bardolator. During his free time, he analyzes William Shakespeare's famous *Much Ado About Nothing*. In this homework, you will use your coding skills to help Barry perform this analysis. <img src="https://static.wikia.nocookie.net/beemovie/images/1/11/Barry-B-Benson.png/revision/latest?cb=20190513100654" alt="barry the bee, the icon" width="40%"> <img src="https://i.pinimg.com/736x/77/e3/1e/77e31e7ba3351999b0b28d4c1467f229.jpg" alt="barry quoting Hamlet holding a skull" width="30%"> ## Part 1: Analyzing Literature Computational tools are helping literature scholars find patterns in text. With the text of many books now available online, we can download that text and use programs to extract data about the text. For this part of the assignment, we will work with the text of the play [Much Ado About Nothing](https://docs.google.com/document/d/1-Y8R3TjHm37mta9QFjoOBlrkysK19y5qajFCveTi8EY). Our goal is to determine the number of lines in the play that are spoken by each character. Specifically, we want to generate a table with two columns: the first column holds names of characters in the play and the second column holds how many lines that character had in the play. There should be only one row per character, but those rows can be in any order. **Unlike with earlier tables work, for this assignment we will be building tables from lists.** We'll introduce the operations that you need as they come up in this assignment. **Task 1:** Copy this code into your `hw5-code.arr` file. The text of the play is defined (in the `shakespeare-text.arr` file) as one long string named `book-text`. ``` include shared-gdrive("shakespeare-text.arr", "1-XGIpbC13ItjxCh7WY14cCjfygk5wz5x") include shared-gdrive("dcic-2021", "1wyQZj_L0qqV9Ekgr9au6RX2iqt2Ga8Ep") ``` To examine the text, we have to convert the single `String` that contains the text to a list of words. **Task 2:** Create a list named `book-words` that contains all of the words in `book-text` in order. In other words, we are converting the single string for the text into a list of the words. You should be able to do this in a single line. Refer to the [Pyret String documentation](https://www.pyret.org/docs/latest/strings.html) if you don't remember the function that we've seen earlier in the semester that does this (professionals refer to documentation to look up functions they've used before all the time). ### Identifying the characters In order to build our table of lines per character, we first need to identify all of the characters in the play. Luckily for us, this play text used character names in ALL-CAPS to label each line (real text is usually messier). Thus, we need to isolate all of the ALL-CAPS names. Since this assignment practices different ways to process lists, we want you write this two ways: once using built-in list operations and once as a recursive function. **Task 3:** Write a function called `find-charactersB` that takes a list of `String` and returns a list of all of the character names in the given list. Your implementation should use built-in list operations and no manually-written recursion. You may write non-recursive helper functions (with or without `lam`, as you prefer). **Task 4:** Now write a second version of this function called `find-charactersR` with the same inputs/outputs, but that uses recursion instead of built-ins. Your output list may contain duplicates (or not, that decision is up to you). **Note:** Remember how we wrote such functions in class: start by writing out the entire list pattern (with the cases and the call to the function on the rest of the list), then try to fill in the holes to finish the function. **Note:** We are not asking for sophisticated text processing here. We're asking you to find words in ALL-CAPS. Don't try to filter out for ALL-CAPS words that might not mark lines, etc. **Our tests assume that you've counted all ALL-CAPS words, plain and simple**. Yes, this includes "I", "O", and "A". **Testing:** You should have several `where` examples for each of these functions (you can use the same examples for both). ### Creating a table of lines per character Now that you have a list of all the names, it's time to create a table summarizing the number of lines per character. We are going to do this using mostly functions over lists. In particular, **do NOT use the `count` or `build-column` functions in the table documentation** (there will be no credit for that solution). **Final table requirements:** the table produced for this part should have one row per character. It should have columns called `"name"` and `"line-count"`. The character rows can be in any order. <!-- :::danger TAs -- for the any-order requirement, our tests should call their function, sort the rows by either name or count, then compare the results. Our version should also strip the "A", "I", and "O" rows (so that they get credit whether or not they thought to remove those lines). ::: --> Roughly, to do this, we're going to create a list of the contents of each column in the desired table, then add those column-lists to the table. Here are functions (that were loaded in the starter code that you copied) that will help you do this: - `create-table-with-col(colname :: String, colvals :: List<Any>) -> Table` <BR> This function takes a column header (like `"name"`) and a list of values and creates a single-column table with those values. - `add-col :: (t :: Table, colname :: String, colvals :: List<Any>) -> Table`<br> This function takes a table and adds a new (rightmost) column with the given colname and values. The list of `colvals` must be the same length as the number of rows (without the header/column name.) In the 10/09 lecture, we went through an example of using these two functions to build up a Table. **Task 5:** Write a function `characters-table` that takes in a `String` with the text of a play and produces a table summarizing the number of lines spoken by each character. Remember that character names in ALL-CAPS are used to label lines spoken by that character. You should not use `count` or `build-column` in this task. **Note:** You may use any combination of built-in operators and recursion to create the lists of column values. Use your code from previous tasks as helpers if you want (and write additional helpers as you need). Write as many helper functions as you need. Sketch out a plan if that would help you, particularly thinking about how the information in each cell of the first column (character names) is used to create each cell of the second column. This is your chance to show what you've learned about writing clean and robust code for a nontrivial problem. **Note:** What should the output of `characters-table` be if the input play text has no characters in it? Make sure the example in the `check` block works! **Task 6:** Develop two good examples of shorter input strings to use to test your `characters-table` function (and use them in a `where` block for that function). Put a comment above each example indicating what situation(s) you designed it to help test. **Note:** To test your function, you can generate a character table then pull out individual cell values to check. This can make your testing much easier and less annoying to write. :::spoiler where-block example starter ``` where: output-char-tab = characters-table(my-input) output-char-tab.row-n(0)["name"] is ... output-char-tab.row-n(0)["line-count"] is ... ... end ``` ::: ## Part 2: Looking at Text in Context Our first text analysis centered around individual words. For some questions, however, we need to look at text in *context*, meaning that we consider the words that occur close to other words. This set of problems looks at how we write programs to consider words in context. Imagine that you are analyzing a text that features two characters named Gabriel (Gabriel Sun and Gabriel Costa). You want to see which one is mentioned more often. To do this, you need to inspect the words immedately *after* "Gabriel" in the text (which could be either one of the characters' last names or some other word). **Task 7:** Use [Examplar](https://pyret.cs.brown.edu/assignment/1NEwuc3YcoJZKf0RwIHUfJfQnsRWeRAzJ) to develop a comprehensive set of tests (in a `check` block) for a function named `count-a-gabriel`, which takes a `String` of main text (multiple words) and a `String` containing a last name to track and returns a `Number`. The function returns the number of times that the given last name directly follows `"Gabriel"` in the main text. Accept any capitalization of "gabriel" and the given last name. **To run the Examplar, press the Run button in the upper right hand corner. Do NOT press the "Begin Implementation" button, doing so will break your Examplar.** *Note: There are 8 broken solutions. Finding 7 will get you full credit (with partial credit for finding fewer than 7; you'll get most of that credit if you find 6).* **Designing a Solution:** Now it's time to think about how to implement `count-a-gabriel`. This is a different style problem than what you've done before, so let's think through how to approach it. As we've discussed in lecture, people program by retrieving/remembering solutions to problems that they've already seen and adapting them to a new situation. We have done problems that count list items before, but not problems that count *surrounding* items. We've seen two approaches to counting occurrences of items: (1) a combination of `filter` and `length` or (2) explicit recursion. :::spoiler Which makes sense here? Think about it before opening this to check your answer. The built-in functions, by design, only look at one element of a list at a time. In this problem, we need the surrounding words, so we have to use explicit recursion. ::: <br> If you already see how to adapt a previous counting approach to write `count-a-gabriel`, jump down to Task 8. Otherwise, read on for guidance. Sometimes, we can successfully adapt code by actually writing out the original version and deciding how to modify it. Set aside checking the last names, and write down the previous approach that you know to just count appearances of `"Gabriel"` in the given string. (We're serious, type it into your file and make sure it works). Now, look at that code and identify what has to be different to count the word after `"Gabriel"` instead. Keep the high-level structure of the code that you have, and decide how you'll need to modify it. **Task 8:** Implement `count-a-gabriel` in your Pyret file. It should take in the same parameters as described in Task 7, and your code should pass the tests that you developed through Examplar. You are welcome to go back to Examplar and add more tests if something occurs to you while you are writing the function. <!-- ### Part 2B: For those going to 200 -- Looking backward too! *This is the first of the bridge problems into CS200. You don't have to do this now (solutions are accepted into January), but you can if you want.* Previously, we looked forwards for which Gabriel (by last name) was being mentioned in the text. This time, we want to look backwards, asking how uses of the last name `"Costa"` distributed across different characters (e.g., Antonio, his brother Carla, etc). To do this, we need a list of all of the words in the text that appear immediately *before* uses of `"Costa"`. For example, processing the string "Mrs. Costa went to Costa Jone's house" would produce the list `[list: "Mrs.", "to"]`. **Task 9:** In `hw5-bridge.arr`, write a function called`find-before-costa` that takes a `String` of text as input and returns a `List` of the words that occur immediately before an occurrence of `"Costa"` in the text. *Note: there are two ways to approach this problem with a recursive list program: either you recur and look ahead to examine the second value rather than the first, or you find a way to keep track of the previous first word and look at the current first word as you recur. We encourage you to consider both and what you see as the tradeoffs of each approach as you write this function.* --> :::info **Interpreting the autograder:** We have made *all* of our autograder tests for `count-a-gabriel` visible for you. Each test *roughly* corresponds to a situation you might have caught using examplar (we say "roughly" here because the tests you write may catch multiple chaffs, and because the order of the buggies as they appear in Examplar does not match the order of the autograder tests). We've done this so that you can decide when to stop working on this problem yourself -- if handling a specific edge case in your implementation is costing you a lot of time, you may choose to sacrifice those one or two autograder points (which will end up being a fraction of a percent of your final grade) and instead focus on your code style, other assignments, or your interests outside of school. ::: Throughout this assignment you have created functions that have helped Barry the Bee analyze a Shakespearean play. Barry is now interested in creating his own Shakespearean play but, since he has limited time, he is considering using artificial intelligence (AI). **Task 9:** Read [this article](https://news.harvard.edu/gazette/story/2023/08/is-art-generated-by-artificial-intelligence-real-art/) about AI-art generators to learn more about the controversy surrounding these online tools and whether or not it’s putting artists, creators, and writers out of work. In a separate document titled `hw5-src.pdf` write your responses to the following questions: * In **3-5 sentences**, referencing the article, to what extent can human creativity be replicated by AI? Explain whether or not Barry should use AI to generate his Shakespearean inspired play. * In **2-4 sentences**, answer the following: to what extent can Barry take full artistic credit for his work if he used AI. (There are no right or wrong answers, but please craft a thoughtful response!) ## Part 3: Creating Datatypes :::warning We will cover the material needed for this part of the homework on Friday, October 18th. These are the shortest tasks in the assignment and do not require much coding. ::: **Task 10:** Create a datatype `FoodItem`. It should have one constructor `food` with three fields:`food-name` (a `String`), `days-until-expiration` (a `Number`) and `keep-cold` (a `Boolean`). *Note: These inputs should be in the same order as written here, in order to work with the autograder. For this question, we're just checking that you can use the syntax from class to define a datatype, nothing fancier than that!* **Task 11:** Create a list named `food-list` containing three `FoodItems` (you choose the details of the items). **Task 12:** Develop a function `is-expired` that determines whether or not a food expired. It should take one `FoodItem` and return a boolean indicating whether the `days-until-expiration` is below zero. ## Check Block (Autograder Compatibility) When you've completed the above tasks, copy the following check block into the bottom of your code file. Then, hit "Run". All the tests should pass: if they don't, this means you have a typo in a function header. ``` check "function name and input checks": exists = lam(f): true end t = table: name :: String, line-count :: Number end check_list = [list:] find-charactersB(check_list) find-charactersR(check_list) characters-table("") is t count-a-gabriel("gabriel hi", "gomes") is 0 exists(food) is true is-expired(food("",100,true)) end ``` ## Handin - Download your coding file and make sure it is called `hw5-code.arr`. - Download your tests file from Examplar and make sure it is called `hw5-tests-examples.arr`. - Download or save your written file and make sure it is a PDF called `hw5-src.pdf`. If your written file is not a PDF, we might not be able to grade it. - Hand in your files on [Gradescope](https://www.gradescope.com/courses/844129). ## Our Inspiration [Barry becoming a working bee!](https://www.youtube.com/watch?v=uC0rgAUPCIM) <img src="https://i0.wp.com/cultfollowing.co.uk/wp-content/uploads/2021/05/bee-movie-review.jpg?resize=1200%2C1125&ssl=1" alt="barry the GOAT" width="50%"> ------ > Brown University CSCI 0111 (Fall 2024) <iframe src="https://forms.gle/52Fi9HdMFRgW1vD67" width="640" height="372" frameborder="0" marginheight="0" marginwidth="0">Loading…</iframe>