Data
Science

Spring 2020
course
site

Jan 31

Topics for this week : visualization, vectors, matrices Please explore chapters 3 & 4 in the text.

visualize

Chapter 3 shows you how to make some basic plots such as histograms and scatterplots in jupyter.

(Be aware that matplotlib - one of the standard ways to make these plots in this environment - has its own idiosyncrasies. Most of the functions change a global default "figure" object, and it may draw (or not draw) a plot depending on what it expects you want. Trying to initiate or show a plot from within a function may not behave the way you expect. Check out its docs - I've put links on the resources page - and follow their examples.)

manipulate

Chapter 4 discusses briefly vectors (1D arrays of numbers) and matrices (2D arrays of numbers), and some ways to combine and operate on them.

I'll walk through some of the code from the book for both chapters and show you another file of graphics I wrote last year, using the attached files.

your mission ...

Your job for this week:

We'll play around in class with these ideas using the iris flower dataset.

Think of this as getting your feet wet for the project(s) you'll dive into more deeply later in the term.

Here are a few datasets you might consider. See the resources page for more options, or search online with terms like "dataset" and a topic you're interested in.

https://cs.marlboro.college /cours /spring2020 /data /notes /jan31
last modified Mon July 13 2020 5:46 am

attachments [paper clip]

  last modified size
TXT 3_visualizing.html Mon Jul 13 2020 05:46 am 555K
TXT 3_visualizing.ipynb Mon Jul 13 2020 05:46 am 263K
TXT 4_vectors.html Mon Jul 13 2020 05:46 am 280K
TXT 4_vectors.ipynb Mon Jul 13 2020 05:46 am 5.8K
TXT fib_fit.html Mon Jul 13 2020 05:46 am 537K
TXT fib_fit.ipynb Mon Jul 13 2020 05:46 am 248K