国产精品XXX视频,www.色呦呦,国产黄色免费观看

新聞中心

這里有您想知道的互聯(lián)網(wǎng)營(yíng)銷解決方案

利用TensorFlow和神經(jīng)網(wǎng)絡(luò)來(lái)處理文本分類問(wèn)題

在這篇文章中，作者討論了六個(gè)關(guān)于創(chuàng)建機(jī)器學(xué)習(xí)模型來(lái)進(jìn)行文本分類的主要話題。

讓客戶滿意是我們工作的目標(biāo)，不斷超越客戶的期望值來(lái)自于我們對(duì)這個(gè)行業(yè)的熱愛(ài)。我們立志把好的技術(shù)通過(guò)有效、簡(jiǎn)單的方式提供給客戶，將通過(guò)不懈努力成為客戶在信息化領(lǐng)域值得信任、有價(jià)值的長(zhǎng)期合作伙伴，公司提供的服務(wù)項(xiàng)目有：域名注冊(cè)、虛擬空間、營(yíng)銷軟件、網(wǎng)站建設(shè)、吳江網(wǎng)站維護(hù)、網(wǎng)站推廣。

TensorFlow 如何工作
機(jī)器學(xué)習(xí)模型是什么
神經(jīng)網(wǎng)絡(luò)是什么
神經(jīng)網(wǎng)絡(luò)怎樣進(jìn)行學(xué)習(xí)
如何處理數(shù)據(jù)并且把它們傳輸給神經(jīng)網(wǎng)絡(luò)的輸入
怎樣運(yùn)行模型并且得到預(yù)測(cè)結(jié)果

作者也提供了可在Jupyter notebook上運(yùn)行的代碼。我將回顧這六個(gè)話題并且與我自己的經(jīng)驗(yàn)相結(jié)合。

1. TensorFlow 概覽

TensorFlow 是***的開源 AI 庫(kù)之一。它的高計(jì)算效率，豐富的開發(fā)資源使它被企業(yè)和個(gè)人開發(fā)者廣泛采用。在我看來(lái)，學(xué)習(xí) TensorFlow 的***的方法就是使用它的官網(wǎng)教程(https://www.tensorflow.org/)。在這個(gè)網(wǎng)站上，你可以瀏覽「getting started」教程。

我首先將會(huì)對(duì) TensorFlow 的基本定義和主要特征進(jìn)行介紹。張量(Tensor)是一種數(shù)據(jù)結(jié)構(gòu)，它可以把原始值形成任意的多維數(shù)組【1】。張量的級(jí)別就是它的維度數(shù)。這里，我建議閱讀 Python 的應(yīng)用編程接口 API，因?yàn)樗鼘?duì) TensorFlow 的初學(xué)者來(lái)說(shuō)是很友好的。你可以安裝 TensorFlow 并且配置環(huán)境，緊隨官方網(wǎng)站上的指導(dǎo)就可以了。測(cè)試你是否成功安裝 TensorFlow 的方法就是導(dǎo)入(import)TensorFlow 庫(kù)。在 TensorFlow 中，計(jì)算圖(computational graph)是核心部件。數(shù)據(jù)流程圖形用來(lái)代表計(jì)算過(guò)程。在圖形下，操作(Operation)代表計(jì)算單位，張量代表數(shù)據(jù)單位。為了運(yùn)行代碼，我們應(yīng)該對(duì)階段函數(shù)(Session function)進(jìn)行初始化。這里是執(zhí)行求和操作的完整代碼。

 
 
 
 
  
  
  
  #import the library 
  
  
  
  import tensorflow as tf 
  
  
  
  #build the graph and name as my_graph 
  
  
  
  my_graph = tf.Graph() 
  
  
  
  #tf.Session encapsulate the environment for my_graph 
  
  
  
  with my_graph.as_default(): 
  
  
  
     x = tf.constant([1,3,6])  
  
  
  
     y = tf.constant([1,1,1]) 
  
  
  
     #add function 
  
  
  
     op = tf.add(x,y) 
  
  
  
     #run it by fetches 
  
  
  
     result = sess.run(fetches=op) 
  
  
  
     #print it 
  
  
  
     print(result)

你可以看見(jiàn)在 TensorFlow 中編譯是遵循一種模式的，并且很容易被記住。你將會(huì)導(dǎo)入庫(kù)，創(chuàng)建恒定張量(constant tensors)并且創(chuàng)建圖形。然后我們應(yīng)該定義哪一個(gè)圖將會(huì)被在 Session 中使用，并且定義操作單元。最終你可以在 Session 中使用 run() 的方法，并且評(píng)估其中參數(shù)獲取的每一個(gè)張量。

2. 預(yù)測(cè)模型

預(yù)測(cè)模型可以很簡(jiǎn)單。它把機(jī)器學(xué)習(xí)算法和數(shù)據(jù)集相結(jié)合。創(chuàng)建一個(gè)模型的過(guò)程程如下圖所示：

我們首先應(yīng)該找到正確的數(shù)據(jù)作為輸入，并且使用一些數(shù)據(jù)處理函數(shù)來(lái)處理數(shù)據(jù)。然后，這些數(shù)據(jù)就可以與機(jī)器學(xué)習(xí)算法結(jié)合來(lái)創(chuàng)建模型了。在你得到模型后，你可以把模型當(dāng)做一個(gè)預(yù)測(cè)器并且輸入需要的數(shù)據(jù)來(lái)預(yù)測(cè)，從而產(chǎn)生結(jié)果。整個(gè)進(jìn)程如下圖所示：

在本文中，輸入是文本，輸出結(jié)果是類別(category)。這種機(jī)器學(xué)習(xí)算法叫做監(jiān)督學(xué)習(xí)，訓(xùn)練數(shù)據(jù)集是已標(biāo)注過(guò)種類的文本。這也是分類任務(wù)，而且是應(yīng)用神經(jīng)網(wǎng)絡(luò)來(lái)進(jìn)行模型創(chuàng)建的。

3. 神經(jīng)網(wǎng)絡(luò)

神經(jīng)網(wǎng)絡(luò)的主要特征是自學(xué)(self-learning)，而不是進(jìn)行明確地程序化。它的靈感來(lái)源于人類中樞神經(jīng)系統(tǒng)。***個(gè)神經(jīng)網(wǎng)絡(luò)算法是感知機(jī)(Perceptron)。

為了理解神經(jīng)網(wǎng)絡(luò)的工作機(jī)制，作者用 TensorFlow 創(chuàng)建了一個(gè)神經(jīng)網(wǎng)絡(luò)結(jié)構(gòu)。

(1) 神經(jīng)網(wǎng)絡(luò)結(jié)構(gòu)

這里作者使用了兩個(gè)隱蔽層(hidden layers)，每一個(gè)隱蔽層的職責(zé)是把輸入轉(zhuǎn)換成輸出層可以使用的東西【1】。***個(gè)隱蔽層的節(jié)點(diǎn)的數(shù)量應(yīng)該被定義。這些節(jié)點(diǎn)叫做神經(jīng)元，和權(quán)值相乘。訓(xùn)練階段是為了對(duì)這些值進(jìn)行調(diào)節(jié)，為了產(chǎn)生一個(gè)正確的輸出。網(wǎng)絡(luò)也引入了偏差(bias)，這就可以讓你向左或向右移動(dòng)激活函數(shù)，從而讓預(yù)測(cè)結(jié)果更加準(zhǔn)確【2】。數(shù)據(jù)還會(huì)經(jīng)過(guò)一個(gè)定義每個(gè)神經(jīng)元最終輸出的激活函數(shù)。這里，作者使用的是修正線性單元(ReLU)，可以增加非線性。這個(gè)函數(shù)被定義為：

 
 
 
 
  
  
  
  f(x) = max(0,x)（輸出是 x 或 0，無(wú)論 x 多大）

對(duì)第二個(gè)隱蔽層來(lái)說(shuō)，輸入就是***層，函數(shù)與***個(gè)隱蔽層相同。

對(duì)于輸出層，作者使用的是 one-hot 編碼來(lái)得到結(jié)果。在 one-hot 編碼中，除了其中的一位值為 1 以外，所有的位元(bits)都會(huì)得到一個(gè) 0 值。這里使用三種類別作為范例，如下圖所示。

我們可以發(fā)現(xiàn)輸出節(jié)點(diǎn)的數(shù)量值就是類別的數(shù)量值。如果我們想要?jiǎng)澐植煌念悇e，我們可以使用 Softmax 函數(shù)來(lái)使每一個(gè)單元的輸出轉(zhuǎn)化成 0 到 1 間的值，并且使所有單元的總和為 1。它將會(huì)告訴我們每種類別的概率是多少。

上述過(guò)程由下列代碼實(shí)現(xiàn)：

 
 
 
 
  
  
  
  # Network Parameters 
  
  
  
  n_hidden_1 = 10        # 1st layer number of features 
  
  
  
  n_hidden_2 = 5         # 2nd layer number of features 
  
  
  
  n_input = total_words  # Words in vocab 
  
  
  
  n_classes = 3          # Categories: graphics, space and baseball

 
 
 
 
  
  
  
  def multilayer_perceptron(input_tensor, weights, biases): 
  
  
  
      layer_1_multiplication = tf.matmul(input_tensor, weights['h1']) 
  
  
  
      layer_1_addition = tf.add(layer_1_multiplication, biases['b1']) 
  
  
  
      layer_1_activation = tf.nn.relu(layer_1_addition)

 
 
 
 
  
  
  
  # Hidden layer with RELU activation 
  
  
  
      layer_2_multiplication = tf.matmul(layer_1_activation, weights['h2']) 
  
  
  
      layer_2_addition = tf.add(layer_2_multiplication, biases['b2']) 
  
  
  
      layer_2_activation = tf.nn.relu(layer_2_addition)

 
 
 
 
  
  
  
  # Output layer with linear activation 
  
  
  
      out_layer_multiplication = tf.matmul(layer_2_activation, weights['out']) 
  
  
  
      out_layer_addition = out_layer_multiplication + biases['out']

 
 
 
 
  
  
  
  return out_layer_addition

在這里，它調(diào)用了 matmul()函數(shù)來(lái)實(shí)現(xiàn)矩陣之間的乘法函數(shù)，并調(diào)用 add()函數(shù)將偏差添加到函數(shù)中。

4. 神經(jīng)網(wǎng)絡(luò)是如何訓(xùn)練的

我們可以看到其中要點(diǎn)是構(gòu)建一個(gè)合理的結(jié)構(gòu)，并優(yōu)化網(wǎng)絡(luò)權(quán)重的預(yù)測(cè)。接下來(lái)我們需要訓(xùn)練 TensorFlow 中的神經(jīng)網(wǎng)絡(luò)。在 TensorFlow 中，我們使用 Variable 來(lái)存儲(chǔ)權(quán)重和偏差。在這里，我們應(yīng)該將輸出值與預(yù)期值進(jìn)行比較，并指導(dǎo)函數(shù)獲得最小損失結(jié)果。有很多方法來(lái)計(jì)算損失函數(shù)，由于它是一個(gè)分類任務(wù)，所以我們應(yīng)該使用交叉熵誤差。此前 D. McCaffrey[3] 分析并認(rèn)為交叉熵可以避免訓(xùn)練停滯不前。我們?cè)谶@里通過(guò)調(diào)用函數(shù) tf.nn.softmax_cross_entropy_with_logits() 來(lái)使用交叉熵誤差，我們還將通過(guò)調(diào)用 function: tf.reduced_mean() 來(lái)計(jì)算誤差。

 
 
 
 
  
  
  
  # Construct model 
  
  
  
  prediction = multilayer_perceptron(input_tensor, weights, biases)

 
 
 
 
  
  
  
  # Define loss 
  
  
  
  entropy_loss = tf.nn.softmax_cross_entropy_with_logits(logits=prediction, labels=output_tensor) 
  
  
  
  loss = tf.reduce_mean(entropy_loss)

我們應(yīng)該找到***值來(lái)使輸出誤差最小化。這里我們使用隨機(jī)梯度下降(SGD)的方法：

通過(guò)多次迭代，我們將會(huì)得到接近于全局最小損失的權(quán)值。學(xué)習(xí)速率不應(yīng)該太大。自適應(yīng)瞬間評(píng)估函數(shù)(Adaptive Moment Estimation function)經(jīng)常用于計(jì)算梯度下降。在這個(gè)優(yōu)化算法中，對(duì)梯度和梯度的二階矩量進(jìn)行平滑處理【4】。

代碼如下所示，在其它項(xiàng)目中，學(xué)習(xí)速率可以是動(dòng)態(tài)的，從而使訓(xùn)練過(guò)程更加迅速。

 
 
 
 
  
  
  
  learning_rate = 0.001 
  
  
  
  # Construct model 
  
  
  
  prediction = multilayer_perceptron(input_tensor, weights, biases)

 
 
 
 
  
  
  
  # Define loss 
  
  
  
  entropy_loss = tf.nn.softmax_cross_entropy_with_logits(logits=prediction, labels=output_tensor) 
  
  
  
  loss = tf.reduce_mean(entropy_loss)

 
 
 
 
  
  
  
  optimizer = tf.train.AdamOptimizer(learning_ratelearning_rate=learning_rate).minimize(loss)

5. 數(shù)據(jù)操作

這一部分對(duì)于分類成功也很重要。機(jī)器學(xué)習(xí)的開發(fā)者們需要更加在意數(shù)據(jù)，這會(huì)為你節(jié)省大量時(shí)間，并讓結(jié)果更加準(zhǔn)確，因?yàn)檫@可以讓你無(wú)需從頭開始更改配置。在這里，筆者需要指出兩個(gè)重點(diǎn)。首先，為每個(gè)單詞創(chuàng)建一個(gè)索引;然后為每個(gè)文本創(chuàng)建一個(gè)矩陣，如果單詞在文本中，則值為 1，否則為 0。以下代碼可以幫助你理解這個(gè)過(guò)程：

 
 
 
 
  
  
  
  import numpy as np    #numpy is a package for scientific computing 
  
  
  
  from collections import Counter

 
 
 
 
  
  
  
  vocab = Counter() 
  
  
  
  text = "Hi from Brazil" 
  
  
  
  #Get all words 
  
  
  
  for word in text.split(' '): 
  
  
  
      vocab[word]+=1 
  
  
  
           
  
  
  
  #Convert words to indexes 
  
  
  
  def get_word_2_index(vocab): 
  
  
  
      word2index = {} 
  
  
  
      for i,word in enumerate(vocab): 
  
  
  
          word2index[word] = i 
  
  
  
           
  
  
  
      return word2index 
  
  
  
  #Now we have an index 
  
  
  
  word2index = get_word_2_index(vocab) 
  
  
  
  total_words = len(vocab) 
  
  
  
  #This is how we create a numpy array (our matrix) 
  
  
  
  matrix = np.zeros((total_words),dtype=float) 
  
  
  
  #Now we fill the values 
  
  
  
  for word in text.split(): 
  
  
  
      matrix[word2index[word]]+= 1 
  
  
  
  print(matrix) 
  
  
  
  >>> [ 1.  1.  1.]

Python 中的 Counter() 是一個(gè)哈希表。當(dāng)輸入是「Hi from Brazil」時(shí)，矩陣是 [1 ,1, 1]。如果輸入不同，比如「Hi」，矩陣會(huì)得到不同的結(jié)果：

 
 
 
 
  
  
  
  matrix = np.zeros((total_words),dtype=float) 
  
  
  
  text = "Hi" 
  
  
  
  for word in text.split(): 
  
  
  
      matrix[word2index[word.lower()]]+= 1 
  
  
  
  print(matrix) 
  
  
  
  >>> [ 1.  0.  0.]

6. 運(yùn)行模型，獲得結(jié)果

在這一部分里，我們將使用 20 Newsgroups 作為數(shù)據(jù)集。它包含有關(guān) 20 種話題的 18,000 篇文章。我們使用 scilit-learn 庫(kù)加載數(shù)據(jù)。在這里作者使用了 3 個(gè)類別：comp.graphics、sci.space 和 rec.sport.baseball。它有兩個(gè)子集，一個(gè)用于訓(xùn)練，一個(gè)用于測(cè)試。下面是加載數(shù)據(jù)集的方式：

 
 
 
 
  
  
  
  from sklearn.datasets import fetch_20newsgroups 
  
  
  
  categories = ["comp.graphics","sci.space","rec.sport.baseb

 
 
 
 
  
  
  
  newsgroups_train = fetch_20newsgroups(subset='train', categoriescategories=categories) 
  
  
  
  newsgroups_test = fetch_20newsgroups(subset='test', categoriescategories=categories)

它遵循通用的模式，非常易于開發(fā)者使用。

在實(shí)驗(yàn)中，epoch 設(shè)定為 10，這意味著會(huì)有 10 次正+反向遍歷整個(gè)數(shù)據(jù)集。在 TensorFlow 中，占位符的作用是用作 Feed 的目標(biāo)，用于傳遞每個(gè)運(yùn)行步驟的數(shù)據(jù)。

 
 
 
 
  
  
  
  n_input = total_words # Words in vocab 
  
  
  
  n_classes = 3         # Categories: graphics, sci.space and baseball

 
 
 
 
  
  
  
  input_tensor = tf.placeholder(tf.float32,[None, n_input],name="input") 
  
  
  
  output_tensor = tf.placeholder(tf.float32,[None, n_classes],name="output")

我們應(yīng)該分批訓(xùn)練數(shù)據(jù)，因?yàn)樵跍y(cè)試模型時(shí)，我們會(huì)用更大的批次來(lái)輸入 dict。調(diào)用 get_batches() 函數(shù)來(lái)獲取具有批處理尺寸的文本數(shù)。接下來(lái)，我們就可以運(yùn)行模型了。

 
 
 
 
  
  
  
  training_epochs = 10 
  
  
  
  # Launch the graph 
  
  
  
  with tf.Session() as sess: 
  
  
  
      sess.run(init) #inits the variables (normal distribution, reme

 
 
 
 
  
  
  
  # Training cycle 
  
  
  
     for epoch in range(training_epochs): 
  
  
  
         avg_cost = 0. 
  
  
  
         total_batch = int(len(newsgroups_train.data)/batch_size) 
  
  
  
         # Loop over all batches 
  
  
  
         for i in range(total_batch): 
  
  
  
             batch_x,batch_y = get_batch(newsgroups_train,i,batch_size) 
  
  
  
             # Run optimization op (backprop) and cost op (to get loss value) 
  
  
  
             c,_ = sess.run([loss,optimizer], feed_dict={input_tensor: batch_x, output_tensor:batch_y})

在這里我們需要構(gòu)建測(cè)試模型，并計(jì)算它的準(zhǔn)確性。

 
 
 
 
  
  
  
  # Test model 
  
  
  
      index_prediction = tf.argmax(prediction, 1) 
  
  
  
      index_correct = tf.argmax(output_tensor, 1) 
  
  
  
      correct_prediction = tf.equal(index_prediction, index_correct)

 
 
 
 
  
  
  
  # Calculate accuracy 
  
  
  
     accuracy = tf.reduce_mean(tf.cast(correct_prediction, "float")) 
  
  
  
     total_test_data = len(newsgroups_test.target) 
  
  
  
     batch_x_test,batch_y_test = get_batch(newsgroups_test,0,total_test_data) 
  
  
  
     print("Accuracy:", accuracy.eval({input_tensor: batch_x_test, output_tensor: batch_y_test}))

然后我們就可以得到結(jié)果：

結(jié)論

本文介紹了如何使用神經(jīng)網(wǎng)絡(luò)和 TensorFlow 來(lái)處理文本分類任務(wù)。它介紹了與實(shí)驗(yàn)有關(guān)的基礎(chǔ)信息，然而，在我自己運(yùn)行的時(shí)候，效果就沒(méi)有作者那么好了。我們或許可以在這個(gè)架構(gòu)的基礎(chǔ)上改進(jìn)一番，在隱藏層中使用 dropout 肯定會(huì)提高準(zhǔn)確性。

在運(yùn)行代碼前，請(qǐng)確認(rèn)你已安裝了***版本的 TensorFlow。有些時(shí)候你可能會(huì)無(wú)法導(dǎo)入 twenty_newsgroups 數(shù)據(jù)集。當(dāng)這種情況發(fā)生時(shí)，請(qǐng)使用以下代碼來(lái)解決問(wèn)題。

 
 
 
 
  
  
  
  # if you didn't download the twenty_newsgroups datasets, it will run with error 
  
  
  
  # this logging can help to solve the error 
  
  
  
  import logging 
  
  
  
  logging.basicConfig()

以下是完整代碼：

 
 
 
 
  
  
  
  import pandas as pd 
  
  
  
  import numpy as np 
  
  
  
  import tensorflow as tf 
  
  
  
  from collections import Counter 
  
  
  
  from sklearn.datasets import fetch_20newsgroups 
  
  
  
  # if you didn't download the twenty_newsgroups datasets, it will run with error 
  
  
  
  # this logging can help to solve the error 
  
  
  
  import logging 
  
  
  
  logging.basicConfig() 
  
  
  
   
  
  
  
  categories = ["comp.graphics","sci.space","rec.sport.baseball"] 
  
  
  
  newsgroups_train = fetch_20newsgroups(subset='train', categoriescategories=categories) 
  
  
  
  newsgroups_test = fetch_20newsgroups(subset='test', categoriescategories=categories) 
  
  
  
   
  
  
  
  print('total texts in train:',len(newsgroups_train.data)) 
  
  
  
  print('total texts in test:',len(newsgroups_test.data)) 
  
  
  
   
  
  
  
  vocab = Counter() 
  
  
  
  for text in newsgroups_train.data: 
  
  
  
     for word in text.split(' '): 
  
  
  
         vocab[word.lower()]+=1 
  
  
  
          
  
  
  
  for text in newsgroups_test.data: 
  
  
  
     for word in text.split(' '): 
  
  
  
         vocab[word.lower()]+=1 
  
  
  
   
  
  
  
   
  
  
  
  total_words = len(vocab) 
  
  
  
  def get_word_2_index(vocab): 
  
  
  
     word2index = {} 
  
  
  
     for i,word in enumerate(vocab): 
  
  
  
         word2index[word.lower()] = i 
  
  
  
          
  
  
  
     return word2index 
  
  
  
   
  
  
  
  word2index = get_word_2_index(vocab) 
  
  
  
   
  
  
  
  def get_batch(df,i,batch_size): 
  
  
  
     batches = [] 
  
  
  
     results = [] 
  
  
  
     texts = df.data[i*batch_size:i*batch_size+batch_size] 
  
  
  
     categories = df.target[i*batch_size:i*batch_size+batch_size] 
  
  
  
     for text in texts: 
  
  
  
         layer = np.zeros(total_words,dtype=float) 
  
  
  
         for word in text.split(' '): 
  
  
  
             layer[word2index[word.lower()]]+= 1 
  
  
  
              
  
  
  
         batches.append(layer) 
  
  
  
          
  
  
  
     for category in categories: 
  
  
  
         y = np.zeros((3),dtype=float) 
  
  
  
         if category == 0: 
  
  
  
             y[0] = 1. 
  
  
  
         elif category == 1: 
  
  
  
             y[1] = 1. 
  
  
  
         else: 
  
  
  
             y[2] = 1. 
  
  
  
         results.append(y) 
  
  
  
              
  
  
  
       
  
  
  
     return np.array(batches),np.array(results) 
  
  
  
   
  
  
  
  # Parameters 
  
  
  
  learning_rate = 0.01 
  
  
  
  training_epochs = 10 
  
  
  
  batch_size = 150 
  
  
  
  display_step = 1 
  
  
  
   
  
  
  
  # Network Parameters 
  
  
  
  n_hidden_1 = 100      # 1st layer number of features 
  
  
  
  n_hidden_2 = 100       # 2nd layer number of features 
  
  
  
  n_input = total_words # Words in vocab 
  
  
  
  n_classes = 3         # Categories: graphics, sci.space and baseball 
  
  
  
   
  
  
  
  input_tensor = tf.placeholder(tf.float32,[None, n_input],name="input") 
  
  
  
  output_tensor = tf.placeholder(tf.float32,[None, n_classes],name="output")  
  
  
  
   
  
  
  
  def multilayer_perceptron(input_tensor, weights, biases): 
  
  
  
     layer_1_multiplication = tf.matmul(input_tensor, weights['h1']) 
  
  
  
     layer_1_addition = tf.add(layer_1_multiplication, biases['b1']) 
  
  
  
     layer_1 = tf.nn.relu(layer_1_addition) 
  
  
  
      
  
  
  
     # Hidden layer with RELU activation 
  
  
  
     layer_2_multiplication = tf.matmul(layer_1, weights['h2']) 
  
  
  
     layer_2_addition = tf.add(layer_2_multiplication, biases['b2']) 
  
  
  
     layer_2 = tf.nn.relu(layer_2_addition) 
  
  
  
      
  
  
  
     # Output layer  
  
  
  
     out_layer_multiplication = tf.matmul(layer_2, weights['out']) 
  
  
  
     out_layer_addition = out_layer_multiplication + biases['out'] 
  
  
  
      
  
  
  
     return out_layer_addition 
  
  
  
   
  
  
  
  # Store layers weight & bias 
  
  
  
  weights = { 
  
  
  
     'h1': tf.Variable(tf.random_normal([n_input, n_hidden_1])), 
  
  
  
     'h2': tf.Variable(tf.random_normal([n_hidden_1, n_hidden_2])), 
  
  
  
     'out': tf.Variable(tf.random_normal([n_hidden_2, n_classes])) 
  
  
  
  } 
  
  
  
  biases = { 
  
  
  
     'b1': tf.Variable(tf.random_normal([n_hidden_1])), 
  
  
  
     'b2': tf.Variable(tf.random_normal([n_hidden_2])), 
  
  
  
     'out': tf.Variable(tf.random_normal([n_classes])) 
  
  
  
  } 
  
  
  
   
  
  
  
  # Construct model 
  
  
  
  prediction = multilayer_perceptron(input_tensor, weights, biases) 
  
  
  
   
  
  
  
  # Define loss and optimizer 
  
  
  
  loss = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=prediction, labels=output_tensor)) 
  
  
  
  optimizer = tf.train.AdamOptimizer(learning_ratelearning_rate=learning_rate).minimize(loss) 
  
  
  
   
  
  
  
  # Initializing the variables 
  
  
  
  init = tf.initialize_all_variables() 
  
  
  
   
  
  
  
  # Launch the graph 
  
  
  
  with tf.Session() as sess: 
  
  
  
     sess.run(init) 
  
  
  
   
  
  
  
     # Training cycle 
  
  
  
     for epoch in range(training_epochs): 
  
  
  
         avg_cost = 0. 
  
  
  
         total_batch = int(len(newsgroups_train.data)/batch_size) 
  
  
  
         # Loop over all batches 
  
  
  
         for i in range(total_batch): 
  
  
  
             batch_x,batch_y = get_batch(newsgroups_train,i,batch_size) 
  
  
  
             # Run optimization op (backprop) and cost op (to get loss value) 
  
  
  
             c,_ = sess.run([loss,optimizer], feed_dict={input_tensor: batch_x,output_tensor:batch_y}) 
  
  
  
             # Compute average loss 
  
  
  
             avg_cost += c / total_batch 
  
  
  
         # Display logs per epoch step 
  
  
  
         if epoch % display_step == 0: 
  
  
  
             print("Epoch:", '%04d' % (epoch+1), "loss=", \ 
  
  
  
                 "{:.9f}".format(avg_cost)) 
  
  
  
     print("Optimization Finished!") 
  
  
  
   
  
  
  
     # Test model 
  
  
  
     correct_prediction = tf.equal(tf.argmax(prediction, 1), tf.argmax(output_tensor, 1)) 
  
  
  
     # Calculate accuracy 
  
  
  
     accuracy = tf.reduce_mean(tf.cast(correct_prediction, "float")) 
  
  
  
     total_test_data = len(newsgroups_test.target) 
  
  
  
     batch_x_test,batch_y_test = get_batch(newsgroups_test,0,total_test_data) 
  
  
  
     print("Accuracy:", accuracy.eval({input_tensor: batch_x_test, output_tensor: batch_y_test}))

【本文是專欄機(jī)構(gòu)“機(jī)器之心”的原創(chuàng)文章，微信公眾號(hào)“機(jī)器之心( id: almosthuman2014)”】

戳這里，看該作者更多好文

文章題目：利用TensorFlow和神經(jīng)網(wǎng)絡(luò)來(lái)處理文本分類問(wèn)題
路徑分享：http://www.5511xx.com/article/coipeig.html

日韩无码专区无码一级三级片|91人人爱网站中日韩无码电影|厨房大战丰满熟妇|AV高清无码在线免费观看|另类AV日韩少妇熟女|中文日本大黄一级黄色片|色情在线视频免费|亚洲成人特黄a片|黄片wwwav色图欧美|欧亚乱色一区二区三区

新聞中心

其他資訊