Skip to main content
This assignment is due on Friday, March 15, 2024 before 11:59PM.
Submission Link: https://classroom.google.com/c/NjUwNDE2MzEwMzQx/a/NjUxODU3Mzc5MzIz/details

Project Milestone 2: Project Preparation and Literature Review

For this milestone, you should read papers that are related to your project and finish cleaning data and setting up a baseline. In particular you should:

  • Make progress on the suggested next steps that the class staff left for you on your project proposal.
  • Read at least 3 papers related to your proposed project. (See the Lit Review section)
  • Finish collecting any datasets that you need and make sure they are in a format that you can work with.
  • Outline a more definite plan on the types of methods you want to use. What will your system look like?
    • If your project involves calling external libraries, do some quick experiments to see whether the methods you want to use are feasible.

For those with extra ambitious projects, we suggest doing these bullet points for this milestone as well:

  • If your project involves training/fine-tuning a neural network or training a reinforcement learner, try training some preliminary models and observing the outputs.
  • If your project involves a user experience of some sort, create a mock-up of the user interface (even if it’s not fully functional/feature-complete yet).

Informal Lit Review

For this milestone we would like you to do a short literature review of at least 3 papers relating to your project. We’d encourage you to do more than 3, but 3 is the minimum requirement for full credit.

There are a couple of ways to find papers.

1) To find good papers, we recommend starting with a highly relevant paper and either looking through the paper’s Related Works section or using Google Scholar to find papers that cite it. You can also use Semantic Scholar to do this as they distinguish between “Highly Influential” citations and standard citations, which could help you narrow down your search.

2) You can also look through conference proceedings, such as those listed for the Paper Presentation (e.g., ACL Anthology, or if you’re doing speech: Interspeech, or ICASSP) to search through only papers that were accepted to top NLP conferences. Doing this may help you find some “diamonds in the rough” (i.e. papers that are of high quality but do not have many citations yet by virtue of them being very new).

3) Alternatively, you can use Google Scholar or Semantic Scholar to search by keywords. This might be more difficult if you are not as familiar with the jargon of the area. If you would like to search this way and are having trouble finding the relevant keywords, feel free to reach out to one of us!

We recommend starting with these three methods to find papers then skimming ~10 or more of them before deciding on which three to include. This should ensure that the three you find are relevant.

Note: While the writing itself might be informal for this part, the paper selection and insights should be high quality.

Side note: this isn’t what’s traditionally called a literature review. A literature review usually requires a lot more work. If you are doing the paper writing for the Grad Assignment, that is more of an actual literature review.

Each team should submit a document titled lit_review.pdf which should contain, for each paper:

  • The title of the paper, the year of publication, and a link to the paper.
  • A paragraph with a short summary of the paper’s contributions. (2-3 sentences)
  • A bulleted list of your thoughts on the paper and how it relates to your project.

Proposal

For the main milestone writeup, create a new document which describes the progress you’ve made so far. Feel free to copy from your project proposal where appropriate. Even if you do copy, you might need to fill in more details. Your document should include the following sections:

Title & Author Names: What are you calling your project? These are not graded but still important! (No section header for these bits)

  1. Project Description: This is the “what” and the “why” of your project.
    • The “what”: Explain the main idea of your project and what you are trying to accomplish. It’s okay if these have changed since the project proposal.
    • The “why”: If it’s a research project, what’s your research question (i.e., what are you trying to prove with this work)? If it’s an artifact that you’re creating, why are you creating it?
  2. Related Work: What previous research will you be using, building off of, or being inspired by? Tell a short “story” using what you found out from the Lit Review above.
  3. Methods: What are the steps to accomplish your project? This should describe both the steps you’ve done so far (data collection and system planning) and the ones you plan to do. This doesn’t have to be the final methods that you end up using, but writing out a plan out will help you figure out some details you might not have thought of otherwise!
  4. Data: Describe what your data is in detail. For example, what data have you decided to use? Where did you get it from, what does it look like, and what are some potential problems with it? Are you collecting your own data? If so, do you see any potential problems with the collection or what the data will look like? If you’re using an environment, what is it like? What sort of information does it give your agent?
  5. Evaluation Plan: What will you be using to evaluate your research question or that the components of your NLP system are working well?
    • These can be things like accuracy, diversity metrics, perplexity, BLEU/ROUGE scores, success rate, human evaluation scores, etc. depending on your project.
  6. Preliminary Results (optional): Describe any results you have so far. These can include results for intermediate steps that build toward your main project goal. If you haven’t gotten anything working succesfully yet, negative results or plans for experiments to run are great to report too! This section should include tables, plots, and example system outputs where applicable.
  7. Attribution: For each of your team mates, write a couple sentences describing their main contributions to the project so far and what each person plans to do for the rest of the project.

What to Submit

Submit the following to Google Classroom:

  • group[#]-lit_review.pdf which contains your literature review, where [#] is your group number. This can be in any format you like.
  • group[#]-milestone2.pdf which contains your updated project paper. To make grading easier, your proposal should include section headers corresponding to each of the bulleted points as well. While not required, we highly recommend that you use LaTeX and a conference template (ACL’s template).

Grading

  • Lit Review: These are the informal list of insights you get from related papers you find. (9 pts)
  • Project Description: Make sure you do not only include a description of the project, but also a reason why this project is important (i.e. motivation) (3pts)
  • Related Work: Do not just summarize previous work, but relate it to your project. (3pt)
  • Methods: What have you completed up to now? What do you plan on doing? Write down a concrete plan of action. (3pt)
  • Data: Do not just list the datasets, but describe them, and explain how you will use them. (3pt)
  • Evaluation Plan: What evaluation metrics you chose and why? Are they applicable to your task? (3pt)
  • Attribution: Do not forget to note which team member will do what. (1pt)