Data
Science

Spring 2020
course
site

Jan 31

Topics for this week : visualization, vectors, matrices Please explore chapters 3 & 4 in the text.

visualize

Chapter 3 shows you how to make some basic plots such as histograms and scatterplots in jupyter.

(Be aware that matplotlib - one of the standard ways to make these plots in this environment - has its own idiosyncrasies. Most of the functions change a global default "figure" object, and it may draw (or not draw) a plot depending on what it expects you want. Trying to initiate or show a plot from within a function may not behave the way you expect. Check out its docs - I've put links on the resources page - and follow their examples.)

manipulate

Chapter 4 discusses briefly vectors (1D arrays of numbers) and matrices (2D arrays of numbers), and some ways to combine and operate on them.

I'll walk through some of the code from the book for both chapters and show you another file of graphics I wrote last year, using the attached files.

your mission ...

Your job for this week:

We'll play around in class with these ideas using the iris flower dataset.

Think of this as getting your feet wet for the project(s) you'll dive into more deeply later in the term.

Here are a few datasets you might consider. See the resources page for more options, or search online with terms like "dataset" and a topic you're interested in.

https://cs.marlboro.college /cours /spring2020 /data /notes /jan31
last modified Thu March 28 2024 7:43 pm

attachments [paper clip]

  last modified size
TXT 3_visualizing.html Thu Mar 28 2024 07:43 pm 555K
TXT 3_visualizing.ipynb Thu Mar 28 2024 07:43 pm 263K
TXT 4_vectors.html Thu Mar 28 2024 07:43 pm 280K
TXT 4_vectors.ipynb Thu Mar 28 2024 07:43 pm 5.8K
TXT fib_fit.html Thu Mar 28 2024 07:43 pm 537K
TXT fib_fit.ipynb Thu Mar 28 2024 07:43 pm 248K