regression.py 文件源码

python
阅读 28 收藏 0 点赞 0 评论 0

项目:course-1 作者: thoughtfulml 项目源码 文件源码
def __init__(self, csv_file = './data/king_county_data_geocoded.csv', data = None, values = None):
    if (data is None and csv_file is not None):
      df = pd.read_csv(csv_file)
      self.values = df['AppraisedValue']
      df = df.drop('AppraisedValue', 1)
      df = (df - df.mean()) / (df.max() - df.min())
      self.df = df
      self.df = self.df[['lat', 'long', 'SqFtLot']]

    elif (data is not None and values is not None):
      self.df = data
      self.values = values
    else:
      raise ValueError("Must have either csv_file or data set")

    self.n = len(self.df)

    self.kdtree = KDTree(self.df)
    self.metric = np.mean

    # TODO: set k to a number, try a few numbers out
    # self.k = None
评论列表
文章目录


问题


面经


文章

微信
公众号

扫码关注公众号