Hi! I'm Peter Lee

My research interest includes Data Infrastructure, Data Science, Backend, and IoT. Kubernetes with Istio. DevOps. With passion on open source, I like to share my project with the community and attend a related conference. Now, I contributed to TensorFlow and other projects.

Work

Vpon Taiwan

Data Engineer (Architect)

2018 Oct -> Present

Taipei, Taiwan

Website: Vpon

In data team: I design new system infra with a new stack. I also help our member to solve their daily problems.

1: Data migration: I help the data team to transfer existing projects from AWS to GCP. Example: Migrate the ETL pipeline from AWS to GCP. Moving our Gitlab from AWS to GCP (Gitlab CI/CD). Transfer data from Hive and S3 to GCP

2: Infrastructure: We use Google Cloud Platform and Amazon Web Service. I designed the architecture of our system. Like the Gitlab CI/CD, ETL flow, Data processing pipeline. For example GKE(Google Kubernetes Engine) with Istio, BigQuery, Cloud Function, and GCE.

3: CI/CD: Design CI/CD pipeline for our project. Our pipeline Build Docker Image, do Unit Test, then deploy to Container Registry. And these image will deploy on our Kubernetes cluster

4: Data tagging service: A high customize tagging service, it processes our data with many kinds of the tag. It provides a view of people interests, language, age, gender and so on

5: ETL pipeline: Our daily cronjob to process many kinds of data. Like the data exchange with our partners. I deployed jobs on Apache Airflow.

6: Elastic Stack: Processing millions record data every day. It can validate data and watch our report in real-time.

PDIS (Public Digital Innovation Space)

Software Engineer in (Alternative Military Service)

2017 Sep -> 2018 Aug

Executive Yuan 行政院, Taipei, Taiwan

Website: PDIS

PDIS is a government organization in Taiwan, it also has another name: Executive Yuan - Ministry of Digital - Audrey Tang's 唐鳳 Office. She leads the PDIS team to help our government. We incubate and facilitate public digital innovation for government.

1: As a software developer: I use Line BOT API to connect our internal systems to make an easy-to-use app to help our colleagues saving their time. I also designed API interfaces to allow other internal systems to hook up with our bot system.Up to now, it already bridged across our meeting reserve system and electronic bulletin board system.

2: About productive and quality: I designed a software that can display real-time subtitle during streaming. To meet the schedule, I took only one week to make it from idea to prototype. This system now works perfectly in all of our video conferences.

3: As a quick learner and contributor: I involved in many open source projects. Like Sandstorm(A open source private cloud with the container), Rocket.Chat(A open source chat app which like Slack). We have fixed many issues and add lots of features, and giving back to the community.

4: Eager to learn everything: when I have free time, I like to think about how to make the system more efficient. I still find more possibility to improve the current system. Like the subtitle display system, I would like to let some part of the system can automatically work. Let the computer can label the people with their name in real-time and display the information on the subtitle. On the road of development, I fixed various issues and bugs of Tensorflow. NVIDIA give me an award as the outstanding community developer in this April.

91app (Nexdoor)

iOS Engineer (Intern)

2013 Jul -> 2014 Sep

Taipei, Taiwan

Website: Chinese(TW)

Nexdoor is a famous App company in 2013, we build iOS/Android/Web for the enterprise. I was an iOS Engineer Intern when I was an undergraduate student. I developed some interesting project, like Geofenece App, LBS Service framework and customize UI. In 2016, most of the employee join new company 91 app.

Publications

Implementation of Lambda Architecture: A Restaurant Recommender System over Apache Mesos

Advanced Information Networking and Applications (AINA)

2017 IEEE 31st International Conference

A Lambda Architecture at DC/OS, using Spark, Spark Streaming, Kafka, Hadoop HDFS.

Stock market analysis from Twitter and news based on streaming big data infrastructure

Awareness Science and Technology (iCAST)

2017 IEEE 8th International Conference

Real-time trend analysis by twitter, using Spark, Spark Streaming, Kafka, Classification Data and Visualize on Web.

Real-time Trend Analysis of Streaming Twitter and News Based on Big Data Infrastructure

電子情報通信学会-講演論文

信学技報, vol. 117, no. 184, SC2017-13, pp. 1-6, 2017年8月.

SC2017-13 2017-08-18 (SWIM, SC)

Selected Projects

PDIS Translate

Bot for getting files and receipt for Gengo translate platform.

RealTimeSubtitle

A open-source plug-in to display subtitle on OBS Studio

Airbox

A open-source pm 2.5 sensor

Serverless, using ELK to store and Visualize data.

Github

Awards

NVIDIA Jetson Community Developer

2018 April - Outstanding-jetson-developer-community-contributions, Porting TensorFlow for NVIDIA Jetson.

NVIDIA Developer Forum

TensorFlow Contributor

Porting TensorFlow to NVIDIA Jetson

Repository tensorflow-nvJetson

TensorFlow #26985, #20025, #19075, #17394

Selected Skills

Data


Kubernetes

Elasticsearch

Spark

Kafka

Airflow

Backend/Web


Node.js

Mongodb

Webpack

Sass

General


Python

Git

Google Cloud platform

Jenkins

Docker

iOS(Swift)

Education

University Of Aizu, Fukushima, Japan.

2016 March - 2017 March

Master Degree of Computer Science.

Double Degree Program.

Tamkang University, Taipei, Taiwan.

2011-2014

Bachelor of Engineering (B.E.) Computer Science.

Embeded System Lab


2014-2017

Master Degree of Computer Science.

Cloud Computing Lab