sf.
Blogs
杂感
About
Blogs
About

Tags

  • ALL
  • big data
  • cloud
  • data mining
  • database
  • deep learning
  • life
  • machine learning
  • nlp
  • python
  • remote sensing
    • Understanding Deep Residual Networks
      Deep residual networks (ResNet) took the deep learning world by storm when Microsoft Research released Deep Residual Learning for Image Recognition. These networks led to 1st-place winning entries ...
      2018-11-02
      @Shuzhan Fan
      deep learning   
    • GCP Products Overview
      Introduction Cloud Compute Engine gcloud GCP Storage Solutions IAM Cloud BigQuery Could Datalab Cloud Datastudio Cloud Pub/Sub Cloud Dataproc Cloud Dataflow Stackdriver Machine Learni...
      2018-10-12
      @Shuzhan Fan
      cloud   
    • Understanding Ensemble machine learning methods
      Ensemble methods in machine learning is a class of method which combines multiple machine learning models into one predictive model in order to have a boost on the prediction accuracy over a single...
      2018-09-19
      @Shuzhan Fan
      machine learning   
    • Understanding Word2Vec and Doc2Vec
      Word embeddings are a type of word representation which stores the contextual information in a low-dimensional vector. This approach gained extreme popularity with the introduction of Word2Vec in 2...
      2018-08-24
      @Shuzhan Fan
      machine learning    nlp   
    • Understanding the mathematics behind Linear Discriminant Analysis (LDA)
      We are already familiar with Logistic Regression classification algorithm. It works fine for two-class classification problems. However, if there are more than two classes, Logistic Regression will...
      2018-07-23
      @Shuzhan Fan
      machine learning   
    • Understanding the mathematics behind linear regression
      Today we are going to talk about linear regression, one of the most well known and well understood algorithms in machine learning. We are going to focus on the simple linear regression, which conta...
      2018-07-02
      @Shuzhan Fan
      machine learning   
    • Understanding the mathematics behind Naive Bayes
      Naive Bayes, or called Naive Bayes classifier, is a classifier based on Bayes Theorem with the naive assumption that features are independent of each other. Without further ado, let’s get straight ...
      2018-06-08
      @Shuzhan Fan
      machine learning   
    • Landsat 8 data access from Google Cloud Storage
      Landsat 8 bandsLandsat 8 is one of NASA’s EOS (Earth Observing System) satellites. It was launched in February 2013 and was aimed to replace its predecessor Landsat 7. The satellite collects images...
      2018-05-18
      @Shuzhan Fan
      remote sensing    cloud   
    • Understanding the mathematics behind Support Vector Machines
      Support Vector Machine (SVM) is one of the most powerful out-of-the-box supervised machine learning algorithms. Unlike many other machine learning algorithms such as neural networks, you don’t have...
      2018-05-07
      @Shuzhan Fan
      machine learning   
    • A data visualization of the impact of tariff spat on midterm election
      The tarrif spat between US and China is on the brink of escalating to a full-blown trade war, which would be a disaster for the global economy. On Tuesday, April 3rd, the Trump administration propo...
      2018-04-09
      @Shuzhan Fan
      life    data mining   
    • Geo-tagged tweets collection using Twitter Streaming API and database
      One research I’m working on is to use Twitter data to predict crime patterns. So, the first thing I need to do is to collect Twitter data. Specifically, since I’m interested in discovering the spat...
      2018-03-20
      @Shuzhan Fan
      big data    nlp    python   
    • Machine Learning Classification Model Evaluation Metrics
      After training the machine learning classification model, we should always evaluate the model to determine if it does a good job of predicting the target value on new unseen data. Among the various...
      2018-02-13
      @Shuzhan Fan
      machine learning   
    • Running Jupyter Notebook with Apache Spark on Google Cloud Compute Engine
      Apache Spark is a powerful open-source cluster-computing framework. Compared to Apache Hadoop, especially Hadoop MapReduce, Spark has advantages such as speed, generality, ease of use, and interact...
      2018-01-25
      @Shuzhan Fan
      big data    cloud    python   
    • How to install and set up MySQL on Mac
      MySQL is probably the most popular open source SQL relational database. Unfortunately, MacOS doesn’t ship with MySQL. I still remember when I took my first database class years ago, the professor h...
      2017-12-27
      @Shuzhan Fan
      database   
    • Using Python subprocess for parallel processing
      Unlike Javascript, which is naturally asynchronous, Python interpreter executes codes in a sequential order. The subsequent jobs have to wait until the completeness of the previous ones. This behav...
      2017-12-17
      @Shuzhan Fan
      python   
    • My first blog
      This is my first blog, EVER! I’ve always been thinking of writing something about the work I do, sharing the knowledge I know, and of course, learning new stuff in turn. Now finally I made my decis...
      2017-12-06
      @Shuzhan Fan
      life   

    SHUZHAN FAN © 2017