选择合适的预训练任务、模型框架和语料
选择合适的 layerEmbedding only; Top layer; All layers (更灵活的方式是像 ELMo 一样自动选择最好的层)
选择迁移方式(to tune or not to tune)Feature extraction: pre-trained parameters are frozenFine-tuning: pre-trained parameters are unfrozen and fine-tuned