pytorch分别用MLP和RNN拟合sinx

52次阅读

共计 3105 个字符，预计需要花费 8 分钟才能阅读完成。

0

实践上带有一个非线性函数的网络可能拟合任意函数。那显然 MLP 和 RNN 是科研拟合 sinx 的。
结尾先把后果给展现进去，而后是代码，最初是我的过程。懒得看的间接看前半部分行了，过程给有趣味的人看看。

先上后果图

注：每次训练 torch 初始化有不同，所以后果有出入。

代码

乍一看挺多的，实际上简略得一批。只不过是定义了两个网络，训练了两次，展现图片的反复代码而已。具体代码曾经正文。

import torch
import math
import matplotlib.pyplot as plt
class MLP(torch.nn.Module):
 def __init__(self):
 super().__init__()
 self.layer1=torch.nn.Linear(1,16)
 self.layer2=torch.nn.Linear(16,16)
 self.layer3=torch.nn.Linear(16,1)
 def forward(self,x):
 x=self.layer1(x)
 x=torch.nn.functional.relu(x)
 x=self.layer2(x)
 x=torch.nn.functional.relu(x)
 x=self.layer3(x)
 return x
# rnn takes 3d input while mlp only takes 2d input
class RecNN(torch.nn.Module):
 def __init__(self):
 super().__init__()
 self.rnn=torch.nn.LSTM(input_size=1,hidden_size=2,num_layers=1,batch_first=True)
 #至于这个线性层为什么是 2 维度接管，要看最初网络输入的维度是否匹配 label 的维度 
 self.linear=torch.nn.Linear(2,1)
 def forward(self,x):
 # print("x shape: {}".format(x.shape))
 # x [batch_size, seq_len, input_size]
 output,hn=self.rnn(x)
 # print("output shape: {}".format(output.shape))
 # out [seq_len, batch_size, hidden_size]
 x=output.reshape(-1,2)
 # print("after change shape: {}".format(x.shape))
 x=self.linear(x)
 # print("after linear shape: {}".format(x.shape))
 return x
def PlotCurve(mlp, rnn, input_x, x):
 # input_x 是输出网络的 x。
 # sin_x 是列表，x 的取值，一维数据、
 # 尽管他们的内容（不是维度）是一样的。能够 print shape 看一下。
 mlp_eval = mlp.eval()
 rnn_eval = rnn.eval()
 mlp_y = mlp_eval(input_x)
 rnn_y = rnn_eval(input_x.unsqueeze(0))
 plt.figure(figsize=(6, 8))
 plt.subplot(211)
 plt.plot([i + 1 for i in range(EPOCH)], mlp_loss, label='MLP')
 plt.plot([i + 1 for i in range(EPOCH)], rnn_loss, label='RNN')
 plt.title('loss')
 plt.legend()
 plt.subplot(212)
 plt.plot(x, torch.sin(x), label="original", linewidth=3)
 plt.plot(x, [y[0] for y in mlp_y], label='MLP')
 plt.plot(x, [y[0] for y in rnn_y], label='RNN')
 plt.title('evaluation')
 plt.legend()
 plt.tight_layout()
 plt.show()
# 常量都取出来，以便改变 
EPOCH=1000
RNN_LR=0.01
MLP_LR=0.001
left,right=-10,10
PI=math.pi
if __name__ == '__main__':
 mlp=MLP()
 rnn=RecNN()
 # x,y 是一般 sinx 的 torch tensor
 x = torch.tensor([num * PI 4 for num in range(left, right)])
 y = torch.sin(x)
 # input_x 和 labels 是训练网络时候用的输出和标签。
 input_x=x.reshape(-1, 1)
 labels=y.reshape(-1,1)
 #训练 mlp
 mlp_optimizer=torch.optim.Adam(mlp.parameters(), lr=MLP_LR)
 mlp_loss=[]
 for epoch in range(EPOCH):
 preds=mlp(input_x)
 loss=torch.nn.functional.mse_loss(preds,labels)
 mlp_optimizer.zero_grad()
 loss.backward()
 mlp_optimizer.step()
 mlp_loss.append(loss.item())
 #训练 rnn
 rnn_optimizer=torch.optim.Adam(rnn.parameters(),lr=RNN_LR)
 rnn_loss=[]
 for epoch in range(EPOCH):
 preds=rnn(input_x.unsqueeze(0))
 # print(x.unsqueeze(0).shape)
 # print(preds.shape)
 # print(labels.shape)
 loss=torch.nn.functional.mse_loss(preds,labels)
 rnn_optimizer.zero_grad()
 loss.backward()
 rnn_optimizer.step()
 rnn_loss.append(loss.item())
 PlotCurve(mlp, rnn, input_x, x)
 一些留神的点（过程）
 有些人的代码是多加了 dalaloader 来做了数据集的 loader，我集体认为没啥必要，这么简略的货色。当然加了 loader 或者更加合乎习惯。
 为什么数据只取了 20 个（从 left 到 right 只有 sinx 的 20 个数据）？我一开始是从 -128 左近取到了 128 左近，然而发现训练成果奇差无比，狐疑人生了都。这仅仅取了 20 个数据，都须要 1000 次训练，更大的数据集的工夫代价可见一斑。
RNN 的 lr 是 0.01，MLP 的是 0.001？这个也是依据 loss 的图来调节的，0.001 在我这个 rnn 里并不适宜，训练太慢了。而且为了和 mlp 的 EPOCH 保持一致，就换了 0.01 的学习率。然而为什么 RNN 比 MLP 降落的慢？这个有待进一步探讨（当然是因为我太菜了）。
 对于 loss function，为什么用 mse loss？轻易选的。我又换了 l1_loss 和其余的 loss 试了，成果差不多，毕竟这么简略的函数拟合，什么损失函数无所谓了。
 论文指出，RNN 系列网络比 MLP 拟合工夫序列数据能力更强，为什么这次训练反而比 MLP 降落更慢？不仅如此，其实如果屡次比拟 MLP 和 RNN 的拟合成果，发现 MLP 成果更稳固更好一些，这又是为什么呢？有待进一步探讨。

 正文完
   pytorch
   发表至： pytorch 
   2020-07-08

       关于pytorch:pytorch版本yolov3如何训练自己的数据集
 
     关于pytorch:PyTorch模型创建与nnModule
 
     关于pytorch:pytorch前向传播和反向传播
 
     关于pytorch:协方差矩阵在torch和numpy中的比较以及自行实现torch协方差矩阵
 
  
 
    KubeSphere-社区开源负载均衡器-Porter-进入-CNCF-云原生全景图
上一篇 
  react仿百度模糊搜索
下一篇

   Just My Socks（注册教程 内含优惠码）

    
   
 
 
 
 
 
     版权说明 
 本站部分资源来自于网络收集，若侵犯了你的隐私或版权，请及时联系我们删除有关信息。联系邮箱:wukuili@gmail.com
 
 
 
   鲁ICP备18007129号   Theme by Puock