Dataset

class CustomDataset(torch.utils.data.Dataset): 
  def __init__(self):
  데이터셋의 전처리를 해주는 부분

  def __len__(self):
  데이터셋의 길이. 즉, 총 샘플의 수를 적어주는 부분

  def __getitem__(self, idx): 
  데이터셋에서 특정 1개의 샘플을 가져오는 함수

Transform

이미지를 변형(transform)을 적용해서 데이터를 조작하고 학습에 적합하게 만든다는 것이 큰 개념이다.

import torchvision.transforms as T

# 이미지 전처리 작업을 정의
preprocess = T.Compose([
    T.Resize((256, 256)), # resize to (256, 256)
    T.RandomHorizontalFlip(), # 
    T.ToTensor(), # normalization with range [0, 1]
    T.Normalize((0.5), (0.5)) # 각 채널별 평균을 뺀뒤 표준편차로 나누어 표준화를 진행, standarazation에 가까움 
])

# 이미지에 전처리 작업 적용
image = Image.open('image.jpg')
imge = preprocess(image)

Grdient

Pytorch의 Gradient란?

LSTM

class Net(nn.Module):
    def __init__(self, input_dim, hidden_dim, seq_len, output_dim, layers):
        super(Net, self).__init__()
        self.hidden_dim = hidden_dim
        self.seq_len = seq_len
        self.output_dim = output_dim
        self.layers = layers

        self.lstm = nn.LSTM(input_size=input_dim, 
                            hidden_size=hidden_dim, 
                            num_layers=layers,
                            batch_first=True)
        
        self.fc = nn.Linear(in_features=hidden_dim, 
                            out_features=output_dim, 
                            bias=True)

    def reset_hidden_state(self, x):
        h_0 = torch.zeros(self.layers, x.size(0), self.hidden_dim)
        c_0 = torch.zeros(self.layers, x.size(0), self.hidden_dim)

        return h_0, c_0

    def forward(self, x):
        h_0, c_0 = self.reset_hidden_state(x)
        output, (h_t, c_t) = self.lstm(x, (h_0, c_0))
        x = self.fc(output[:, -1])
        return x

Parameters

torch.nn.LSTM(input_size, hidden_size, num_layers, bias=True, batch_first=True, dropout, bidirectional)

자세한 설명

input_size:input의 feature dimension을 넣어주어야 한다. time step이 아닌 input feature dimension
hidden_size: 내부에서 어떤 feature dimension으로 바꿔주고 싶은지를 나타냄
num_layers: 재귀 층의 갯수

bias: bias term을 둘 것인가 (Default: True)
batch_first: batch_first=True라면, (Batch_size, Time_step, Input_feature_dimension) 순서 (Default: False)
dropout: Dropout의 비율, generalization 잘 안되면 조정
bidirectional: 양방향 여부 (bidirectional 하면 [forward, backword]가 되어 feature dimension 2배)

Reset State

output, (h_t, c_t) = self.lstm(x, (h_0, c_0))

Input은 입력 sequence와 초기 state (LSTM이면 {hidden state, cell state} 그 외는 {hidden state})로 구성
초기 state가 없다면 넣어주지 않아도 되고 자동으로 zero로 설정 
입력 sequence의 dimension은 (Batch, Time_step, Feature dimension)순 (batch_first=True 기준)

LSTM Cell

def LSTMCell(input, hidden, w_ih, w_hh, b_ih=None, b_hh=None):
    if input.is_cuda:
        ...
    hx, cx = hidden
    gates = F.linear(input, w_ih, b_ih) + F.linear(hx, w_hh, b_hh)
    ingate, forgetgate, cellgate, outgate = gates.chunk(4, 1)
    ingate     = F.sigmoid(ingate)
    forgetgate = F.sigmoid(forgetgate)
    cellgate   = F.tanh(cellgate)
    outgate    = F.sigmoid(outgate)
    cy = (forgetgate * cx) + (ingate * cellgate)
    hy = outgate * F.tanh(cy)
    return hy, cy

1_hl9UVtgIcQkDIGD8VFykdw

Equation

$i_t=\sigma(W_{ii}x_t+b_{ii}+W_{hi}h_{t-1}+b_{hi})$
$f_t=\sigma(W_{if}x_t+b_{if}+W_{hf}h_{t-1}+b_{hf}$
$g_t=\tanh(W_{ig}x_t+b_{ig}+W_{hg}h_{t-1}+b_{hg})$
$o_t=\sigma(W_{io}x_t+b_{io}+W_{ho}h_{t-1}+b_{ho})$
$c_t=f_t\odot c_{t-1}+i_t\odot g_t$
$h_t=o_t\odot\tanh(c_t)$

Parameters

LSTM Cell 파라미터 설명

torch.nn.LSTMCell(input_size, hidden_size, bias=True, device=None, dtype=None)

input_size: 입력으로 주어지는 데이터의 feature의 갯수
hidden_size: hidden state $h$의 벡터 크기

Pytorch - soup1997/Study-Alone GitHub Wiki

Dataset

Transform

Grdient

LSTM

Parameters

Reset State

LSTM Cell

Equation

Parameters

⚠️ GitHub.com Fallback ⚠️

Pytorch - soup1997/Study-Alone GitHub Wiki

Dataset

Transform

Grdient

LSTM

Parameters

Reset State

LSTM Cell

Equation

Parameters

⚠️ **GitHub.com Fallback** ⚠️

⚠️ GitHub.com Fallback ⚠️