Mol-BERT: An Effective Molecular Representation with BERT for Molecular Property Prediction

<table class="table-group" id="tab2"><tr><td><table class="table"><tr><td class="thead-hr" colspan="2"><hr/></td></tr><tr class="thead"><td class="align_left">Parameter</td><td class="align_center">Value/range</td></tr><tr><td class="thead-hr" colspan="2"><hr/></td></tr><tr><td class="align_left">Learning rate</td><td class="align_center">1<span class="nowrap"><svg height="6.1673pt" id="M16" style="vertical-align:-0.2063904pt" version="1.1" viewbox="-0.0498162 -5.96091 5.50181 6.1673" width="5.50181pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M391 364C391 409 353 448 295 448C249 448 198 426 152 393C65 331 23 225 23 139C23 14 96 -12 146 -12C198 -12 280 9 367 101L351 124C300 78 242 48 194 48C129 48 109 107 109 162V191C208 213 391 266 391 364ZM313 350C313 305 268 261 113 223C132 334 187 381 217 398C227 404 244 405 261 405C290 405 313 385 313 350Z"></path></g></svg>-</span>5∼1<span class="nowrap"><svg height="6.1673pt" id="M17" style="vertical-align:-0.2063904pt" version="1.1" viewbox="-0.0498162 -5.96091 5.50181 6.1673" width="5.50181pt" xmlns="http://www.w3.org/2000/svg" xmlns:xlink="http://www.w3.org/1999/xlink"><g transform="matrix(.013,0,0,-0.013,0,0)"><path d="M391 364C391 409 353 448 295 448C249 448 198 426 152 393C65 331 23 225 23 139C23 14 96 -12 146 -12C198 -12 280 9 367 101L351 124C300 78 242 48 194 48C129 48 109 107 109 162V191C208 213 391 266 391 364ZM313 350C313 305 268 261 113 223C132 334 187 381 217 398C227 404 244 405 261 405C290 405 313 385 313 350Z"></path></g></svg>-</span>3</td></tr><tr><td class="align_left">Batch size</td><td class="align_center">8</td></tr><tr><td class="align_left">Epoch</td><td class="align_center">100</td></tr><tr><td class="align_left">Optimizer</td><td class="align_center">Adam</td></tr><tr><td class="align_left">Embedding dimension</td><td class="align_center">300</td></tr><tr><td class="align_left">Size of dictionary</td><td class="align_center">13,325</td></tr><tr><td class="align_left">Number of attention head</td><td class="align_center">6</td></tr><tr><td class="align_left">Layers of fully connected neural network</td><td class="align_center">6</td></tr><tr class="table-tr"><td colspan="2"><hr class="tbody-hr"/></td></tr></table></td></tr></table>

<div>The fine-tuning hyperparameters.</div>

Wireless Communications and Mobile Computing

tab2

Table 2

Table 2: Mol-BERT: An Effective Molecular Representation with BERT for Molecular Property Prediction