ひよこ、通勤中。

通勤中の電車の中でひよこは何を思うのか。

2018-01-19

いつものHML分析

d = access_features['count'].sort_values().reset_index()
d.columns = ['base_index', 'count']
d = d.reset_index() 

from sklearn.cluster import KMeans
from matplotlib import cm
kmeans = KMeans(n_clusters=3)
kmeans.fit(d[['count']])
d.loc[:, 'cluster'] = kmeans.labels_

fig, ax = plt.subplots()
colors = cm.Accent.colors
for i in range(3):
    target = d[d['cluster'] == i]
    ax.scatter(x=target['index'], y=target['count'], c=colors[i]

f:id:cocodrips:20180119160746p:plain

2018-01-18

pandasで日時周り

dfにtimeカラムがあることを想定

文字列 -> datetime型

timeが2017-01-01とかの文字列だったとき

pd.to_datatime(df['time'])

でだいたいよしなにparseしてくれる

unixtime -> datetime型

timeがunixtimeだったとき

pd.to_datatime(df['time'], unit='s')

一部情報を取り出す(日付、月、時間等)

datetime = pd.to_datatime(df['time'])
datetime.dt.hour

2017-11-20

数百字のテキストを分類/クラスタリングしてみる　with Keras

参考文献メモ。

分類

短文ならLSTM, 長文ならRNNが良いという噂。

クラスタリングするなら

色々

NLPでクラスタリングにAutoEncoderを使う

Autoencoders: Applications in Natural Language Processing

Recursive AutoEncoder

Autoencoders: Recursive Autoencoders

LSTMを理解する

Understanding LSTM Networks -- colah's blog

KerasでLSTM AutoEncoder

2017-10-05

PyCharmでRoot以外にもPythonPathを追加したい時

PyCharm: 2017.1バージョン

Preference > Project: Project Structure > パスを通したいフォルダを選択して「Sources」ボタンをクリック

f:id:cocodrips:20171005121253p:plain

2017-10-03

postgresqlでcsvファイルから一括upsert

全部1行ずつupsertしてたら100万件で数時間かかったので、他の解決策を考える。

1時テーブルを作成しupdate + insert

PostgreSQL CSV 取り込み upsert | odekakeshimasyo.me

copy from => tmp table
tmp tableからUpdate
tmp tableからUpdateした分を削除してInsert

PostgreSQLメモ

Activeで何クエリが走ってるか

select pid, query from pg_stat_activity where state = 'active';

クエリジョブのkill

select pg_cancel_backend(pid);

2017-09-27

pythonの型に関する話

関数アノテーション: https://www.python.org/dev/peps/pep-3107/
変数アノテーション: https://www.python.org/dev/peps/pep-0526/
Type Hinting: https://www.python.org/dev/peps/pep-0484/

2017-09-27

Pythonのdocstringの書き方について

3つの書き方

自分の基本的な書き方

基本的にはreSTの書き方しているつもり。

''' 1行関数・クラス説明

複数行説明...

:param type param-name: param-description
:return: return-description
:rtype: return-type
'''

docstringのtypeの書き方

Type Hinting in PyCharm - Help | PyCharm